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ABSTRACT 

This  project  deals  with  the  development  of  scaling  laws  for  multiscale  simulations  from 
atomistic  to  continuum,  via  mesoplasticity  and  applied  for  such  material  testing  techniques  as 
tension  and  indentation.  The  main  objective  is  to  address  some  of  the  critical  issues  involved  in 
multiscale  material  modeling  -  theory  and  simulation.  The  following  specific  problems  were 
addressed  in  this  project:  (1)  multiscale  simulation  from  atomistic  to  continuum  (or  vice  versa) 
via  mesoplasticity  by  coupling  molecular  dynamics  (MD)  with  the  material  point  method  (MPM) 
in  microtensile  testing  and  nanoindentation.  The  simulations  are  animated  so  that  the  process 
under  consideration  can  be  observed  in  situ  under  dynamic  conditions;  (2)  material  behavior  at 
nano-  and  microlevels  using  an  in  situ  microtensile  stage  on  an  atomic  force  microscope  (AFM) 
and  a  nanoindentation  system,  (3)  multiscale  simulation  using  the  generalized  interpolation 
material  point  (GIMP)  method  and  parallel  processing;  (4)  combined  finite  element  method 
(FEM)  and  nanoindentation  tests  to  determine  the  material  properties  of  single  crystal  copper  in 
different  orientations,  incorporating  mesoplastic  constitutive  laws,  (5)  development  of  accurate 
and  robust  potentials  from  ab  initio  calculations  using  a  commercial  (Gaussian  03)  software, 
novelty  sampling,  and  feed-forward  neural  networks  (NN);  and  (6)  Monte  Carlo-Steepcst 
Descent  simulations  of  nanometric  cutting. 

Keywords:  Scaling  laws,  multiscale  modeling,  nanoindentation,  material  point  method  (MPM), 
molecular  dynamics  (MD)  simulations,  neural  networks  (NN),  mesoplasticity, 
multilevel  refinement,  ab  initio  potentials,  Monte  Carlo  (MC)  Simulation 
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1.  INTRODUCTION 

Despite  significant  developments  in  materials  simulation  techniques,  the  goal  of  reliably 
predicting  the  properties  of  new  materials  in  advance  of  material  characterization  and  fabrication 
has  not  yet  been  achieved.  It  is  also  not  possible  to  predict  the  properties  of  the  material  at 
nanoscale  knowing  the  properties  of  the  material  at  macrolevel  or  vice  versa.  This  situation 
exists  for  several  reasons  that  include  a  lack  of  reliable  potentials  to  describe  the  behavior  of  the 
material,  computational  limitations,  inadequate  modeling  of  the  process,  absence  of  scaling  laws, 
and  difficulties  associated  with  the  experimental  measurement  of  properties  even  at  the 
microscale,  let  alone  at  nanoscale. 

Scaling  laws  governing  the  mechanical  behavior  of  materials  from  atomistic  (nano),  via 
mesoplastic  (micro),  to  continuum  (macro)  are  very  important  to  numerous  DoD  applications, 
such  as  the  development  of  a  new  class  of  aircraft  engine  materials,  or  new  steels  for  naval  battle 
ships,  or  new  tank  armor  materials  for  the  army,  or  numerous  microelectromechanical 
components  for  a  myriad  of  applications,  for  the  following  three  reasons:  (1)  information  on  the 
mechanical  behavior  of  materials  at  nanolevel  is  not^-presently  available  as  input  to 
nanotechnology  for  the  manufacturing  of  nanocomponents  or  microelectro-mechanical  systems 
(MEMS).  For  example,  nanostructures  may  possess  unique  properties  in  view  of  their  very  high 
surface  to  volume  ratio,  or  the  nanostructures  might  be  relatively  free  of  defects  with  strength 
approaching  the  theoretical  values,  (2)  applications  where  two  length  scales  of  different  orders  of 
magnitude  are  involved.  For  example,  one  is  atomistic  (nano)  and  the  other  mesoplastic  (micro) 
as  in  nanoindentation,  or,  one  is  mesoplastic  (micro)  and  the  other  continuum  (macro)  as  in 
conventional  indentation,  and  (3)  it  may  be  possible  to  extend  the  knowledge  accumulated  over 
time  on  material  behavior  at  the  macro  (or  continuum)  level  to  the  atomistic  (or  nano)  level,  via 
mesoplastic  (micro)  level.  In  this  investigation,  multiscale  modeling  and  simulations  of  material 
properties  are  addressed  considering  tensile  testing  and  indentation  as  two  examples.  These  two 
applications  are  chosen  because  of  the  extensive  use  of  these  techniques  for  determining  a  wide 
range  of  material  properties,  including  modulus,  yield  strength,  ultimate  strength,  strain  at 
fracture,  and  hardness.  It  may  be  noted  that  the  scaling  laws,  as  well  as  modeling  and  simulation 
tools  to  be  developed  in  this  investigation,  will  be  generic  and  can  be  applied  to  other  material 
processing  techniques  and/or  material  transformation  mechanisms  involved  in  these  processes. 
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Kalia  el  al.  [1]  outlined  the  following  benefits  of  nanophase  materials:  (1)  Unique 
physical  properties  resulting  from  small  grain  size  and  numerous  grains;  (2)  Unique  properties 
strongly  influenced  by  the  presence  of  a  large  number  of  atoms  at  the  interphasc  or  grain 
boundaries;  (3)  Ability  to  sinter  at  significantly  lower  temperatures  than  conventional  coarse¬ 
grained  solids  leading  to  considerable  savings  in  the  synthesis  of  high  temperature  materials;  (4) 
Remarkable  improvements  in  mechanical  properties,  such  as  hardness,  strength,  plastic 
deformation  as  the  materials  are  relatively  free  of  defects;  (5)  Enhanced  ductility,  especially  of 
some  inherently  brittle  ceramics,  due  to  grain-boundary  sliding.  Small  grains  also  facilitate 
efficient  impurity  doping  of  these  materials;  and  (6)  Unique  chemical  properties  due  to  high 
surface  to  volume  ratio  as  well  as  large  porosity  that  increases  the  adsorption  area  and  makes 
them  better  catalysts. 

Connecting  atomistic  mechanisms  of  deformation  and  fracture  to  macroscopic  behavior 
requires  an  appropriate  formalism  that  bridges  the  length  scales  of  these  processes  over  several 
orders  of  magnitude.  According  to  Hartley  [2],  the  essence  of  multiscale  modeling  of  the 
properties  of  materials  is  the  ability  to  develop  a  description  of  material  behavior  on  a 
macroscopic  scale  from  models  based  on  atomic  scale  mechanisms  or  vice  versa,  applying  to 
deformation  of  crystalline  materials,  this  takes  the  form  of  translating  the  behavior  of  individual 
dislocations  or  finite  groups  into  a  description  of  the  global  response  of  a  mass  of  material  to 
externally  applied  loads  and  displacements.  Any  such  model  is  subjected  to  restrictions  imposed 
by  the  geometry  of  the  crystal  lattice,  active  slip  systems,  the  orientations  and  shapes  of  grains 
within  the  material,  and  the  laws  of  thermodynamics. 

It  is  well  known  that  many  macroscopic  phenomena  have  their  origins  in  the  microscopic  or 
even  atomistic  phenomena,  For  example,  the  coalescence  of  microcracks  leading  to  gross 
fracture  has  microscopic  origin.  Similarly,  the  tip  of  a  microcrack  has  its  origin  at  the  atomistic 
level.  Another  example  where  a  macro  phenomenon  has  microscopic  origin,  which  in  turn  has 
atomistic  origin,  is  metal  cutting.  The  gross  deformation  in  the  primary  shear  zone  on  a 
macroscale  is  due  to  severe  plastic  deformation  (shear  strain  -2-6)  of  a  polycrystalline  material 
across  the  grain  boundaries.  But  at  the  microlevel,  the  discontinuous  nature  of  the  chip  formation 
leads  to  shear  lamella  (a  few  micrometers  wide)  followed  by  a  shear  front  (a  fraction  of  a 
micrometer  wide).  At  the  atomistic  level,  the  deformation  ahead  of  the  tool  depends  on  the 
structure  and  orientation  of  the  single  crystal  workmaterial  relative  to  the  tool  motion.  The  main 
objective  of  this  proposal  is  to  develop  a  fundamental  understanding  of  indentation  and  tension 
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at  various  scales  and  develop  a  computer  simulation  code  that  would  bridge  the  gap  from 
atomistic  to  continuum,  via.  mesoplastic  or  microscopic  behavior.  The  outcome  of  this  work  will 
provide  the  required  inputs  to  CAD  so  that  relevant  output  parameters  can  be  used  by  the 
engineer  in  the  design  of  components  at  any  level  of  detail,  namely  ,  large  macro  components,  or 
small  microcomponents,  or  even  extremely  small  nanocomponents. 


fm  pi  ....  log  I  in ic 

p  lofl  time  id  ** 

.  Figure  2  Schematic  of  various  spatial 

Figure  1  Schematic  of  simulations  at  and  temporal  scales  in  tribology 

various  temporal  and  spatial  levels 


Figure  1  is  a  schematic  of  the  various  computer  simulation  methods  used  in  the 
development  of  scaling  laws  as  a  function  of  spatial  and  temporal  variables.  At  the  subatomic 
level,  we  use  quantum  mechanics  or  ab  initio  calculations  to  develop  the  potential-energy 
functions.  With  increase  in  length  (or  time)  scale,  we  model  material  behavior  using  molecular 
dynamics  (MD)  and  Monte  Carlo  (MC)  simulations,  then  to  micro  or  mesoplastic,  and  finally  to 
continuum  mechanics  using  finite  element  methods  (FEM).  Figure  2  is  a  schematic  showing 
various  spatial  and  temporal  scales  for  a  typical  tribological  phenomenon  for  a  wide  range  of 
applications.  It  may  be  noted  at  the  outset  that  any  scaling  law  developed  would  involve  both 
temporal  and  spatial  scales.  But,  generally,  as  the  length  scale  increases,  so  does  the  time  scale. 
The  challenge  is  how  to  develop  methods  that  permit  simulation  of  a  process  with  a  fewer 
number  of  atoms  (for  e.g.  106  instead  of  10M  atoms  in  a  cube)  or  shorter  time  scale  (for  e.g,, 
microseconds  instead  of  seconds)  but  at  the  same  time  represent  the  process  at  the  meso  or 
macro  scale.  Alternatively  stated,  the  problem  is  to  develop  methods  that  link  a  process  at 
atomistic  level  to  its  neighborhood  at  the  microlevel,  and  from  there  to  the  continuum  level. 


} 


5 


What  are  the  rules  linking  these  three  levels?  An  attempt  is  made  in  this  project  to  address  these 
and  other  scientific  and  technological  issues. 

2.  OBJECTIVES 

The  objective  of  this  investigation  is  to  contribute  scientific  and  technological  issues 
towards  the  development  of  scaling  laws  from  atomistic  to  continuum  using  simulations  based 
on  such  material  testing  methods  as  microtension  and  nanoindentation.  Other  objectives,  in 
addition  to  the  technical  objective,  include  contribution  towards  the  development  of 
infrastructure  and  training  of  research  personnel  (or  human  resources)  in  disciplines  critical  to 
the  DoD  mission. 

Technical  Objectives: 

(1)  Multiscale  simulation  from  atomistic  to  continuum  (or  vice  versa)  via  mesoplasticity  by 
coupling  molecular  dynamics  (MD)  with  the  material  point  method  (MPM)  in 
microtensile  testing  and  nanoindentation.  The  simulations  are  animated  so  that  the 
process  under  consideration  can  be  observed  in  situ  under  dynamic  conditions; 

(2)  Material  behavior  at  nano-  and  microlevels  using  an  in  situ  microtensile  stage  on  an 
atomic  force  microscope  (AFM)  and  a  nanoindentation  system, 

(3)  Multiscale  simulation  using  the  generalized  interpolation  material  point  (GIMP)  method 
and  parallel  processing; 

(4)  Combined  finite  element  method  (FEM)  and  nanoindentation  tests  to  determine  the 
material  properties  of  single  crystal  copper  in  different  orientations,  incorporating 
mesoplastic  constitutive  iaws; 

(5)  Development  of  accurate  and  robust  potentials  from  ab  initio  calculations  using  a 
commercial  (Gaussian  03)  software,  novelty  sampling,  and  feed-forward  neural  networks 
(NN);  and 

(6)  Monte  Carlo-Steepcsl  Descent  simulations  of  nanometric  cutting. 


Infrastructure  Development: 

To  develop  computational  facilities  involving  massive  parallel  processing  capable  of 
modeling  and  simulation  of  materials  properties  and  design  as  well  as  state-of-the  art 
experimental  facilities  so  as  to  advance  OSU's  research  infrastructure  development  goals.  The 
Advanced  Computer  Simulation  Laboratory  as  well  as  the  Advanced  Materials  and  Processing 
Laboratory  will  be  a  showcase  for  OSU  in  particular  and  the  State  of  Oklahoma,  in  general. 

Training  of  Research  Personnel: 

To  contribute  towards  the  education  of  future  scientists  and  engineers  in  disciplines  critical 
to  the  DoD  mission.  Both  graduate  and  undergraduate  students  will  be  trained  to  address  the 
proposed  research  that  is  relevant  to  the  defense  mission  with  chemistry,  physics,  mechanical 
engineering,  and  materials  engineering  perspectives. 

In  the  following,  brief  summaries  of  various  problems  involved  in  multiscale  modeling  from 
atomistic  to  continuum  addressed  in  this  project  will  be  outlined.  They  were  covered  in  12 
technical  papers  published  in  archival  journals  and  included  in  this  report  for  completeness. 

3.  ACCOMPLISHMENTS 

3.1.  Multiscale  Simulation  from  Atomistic  to  Continuum  -  Coupling  Molecular  Dynamics 
(MD)  with  Material  Point  Method  (MPM)  (Lu  et  a!.  [3]) 

In  this  investigation,  we  introduced  a  new  multiscale  simulation  approach  that  coupled 
atomistic-scale  simulations  using  molecular  dynamics  (MD)  with  continuum-scale  simulations 
using  the  material  point  method  (MPM)  originally  developed  by  Sulsky,  Zhou,  and  Shreyer  [4] 
in  1995.  In  MPM,  material  continuum  is  represented  by  a  finite  collection  of  material  points 
carrying  all  relevant  physical  characteristics,  such  as  mass,  acceleration,  velocity,  strain  and 
stress.  The  use  of  material  points  at  the  continuum  level  provides  a  natural  connection  with  the 
atoms  in  the  lattice  at  the  atomistic  scale.  A  hierarchical  mesh  refinement  technique  in  MPM  is 
developed  to  scale  down  the  continuum  level  to  the  atomistic  level,  so  that  material  points  at  the 
fine  level  in  MPM  are  allowed  to  directly  couple  with  the  atoms  in  MD.  A  one-to-one 
correspondence  of  MD  atoms  and  MPM  points  is  used  in  the  transition  region  and  non-local 
elastic  theory  is  used  to  assure  compatibility  between  MD  and  MPM  regions,  so  that  seamless 
coupling  between  MD  and  MPM  can  be  accomplished. 
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A  single  crystal  silicon,  subjected  to  uniaxial  tension,  is  used  to  demonstrate  the  viability  of 
the  technique,  A  Tersoff-type,  three-body  potential  was  used  in  the  MD  simulations,  The 
coupled  MD/MPM  simulations  showed  that  silicon  under  nanometric  tension,  experiences  with 
increasing  elongation  in  elasticity,  dislocation  generation  and  plasticity  by  slip,  void  formation 
and  propagation,  formation  of  amorphous  structure,  necking,  and  final  rupture.  Results  are 
presented  in  terms  of  stress-strain  relationships  at  several  strain  rates,  as  well  as  the  rate 
dependence  of  uniaxial  material  properties.  This  new  multiscale  computational  method  has 
potential  for  use  in  cases  where  a  detailed  atomistic-level  analysis  is  necessary  in  localized 
spatially  separated  regions  whereas  continuum  mechanics  is  adequate  in  the  rest  of  the  material. 

3.2.  Two-Dimensional  Mixed  Mode  Crack  Simulation  Using  the  Material  Point  Method  (Wang 
et  al,  [5]) 

The  material  point  method  (MPM)  was  used  by  Sulsky,  Zhou,  and  Shreyer  [4]  to 
demonstrate  its  capabilities  in  the  simulation  of  impact/contact/penetration  and  interfacial  crack 
growth  problems.  Because  of  the  use  of  material  points  in  the  description  of  a  continuum, 
consistent  with  the  particle  description  (atoms)  using  molecular  dynamics  (MD),  it  is  natural  to 
couple  MPM  with  MD  for  simulation  from  atomistic  to  continuum  levels.  However,  in 
addressing  plane  stress/plane  strain  problems,  the  MPM  algorithm  and  simulation  examples 
available  in  literature  use  a  regular  grid  mesh  with  uniform  square  cells  and  enforce  velocity  and 
displacement  continuities  through  its  background  grid,  resulting  in  limitations  in  dealing  with 
stress  concentration,  inclined  dislocations  and  inclined  crack,  etc. 

In  this  investigation,  we  implemented  an  irregular  mesh  in  MPM  to  eliminate  the  [imitations 
resulting  from  (he  use  of  a  regular  mesh.  The  ray -crossing  algorithm  was  employed  to  determine 
which  cell  a  material  point  belongs  to  after  deformation  for  interpolation  and  extrapolation  of 
variables  between  material  points  and  grid  nodes.  As  an  example  to  demonstrate  the  capability  of 
the  MPM  using  irregular  mesh,  the  stress  field  in  a  continuum  with  an  inclined  crack  is 
determined  using  arbitrary  quadrilateral  cells  in  the  background  grid  mesh.  The  use  of  irregular 
mesh  in  MPM  was  verified  by  comparing  MPM  results  with  ABAQUS/Explicit  simulation.  The 
proposed  method  of  using  irregular  mesh  will  be  an  essential  element  in  using  MPM  to  couple 
with  atomistic  scale  simulation  so  that  MPM  can  address  inclined  dislocations  and  cracks 
emanating  from  the  atomistic  simulation. 
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3.3,  Multiscale  Simulations  Using  Generalized  Interpolation  Material  Point  (GIMP)  Method 
and  SAMRAI  Parallel  Processing  (Ma  et  al.,  [6]) 

In  the  simulation  of  a  wide  range  of  mechanics  problems  including  impact/contact/ 
penetration  and  fracture,  the  material  point  method  (MPM)  has  demonstrated  its  computational 
capabilities.  To  resolve  alternating  stress  sign  and  instability  problems  associated  with 
conventional  MPM,  Bardenhagen  and  Kober  [7]  in  2004  introduced  the  generalized  interpolation 
material  point  (GIMP)  method  and  implemented  for  one-dimensional  simulations. 

In  this  investigation,  we  have  extended  GIMP  to  2D  and  applied  to  simulate  simple  tension 
and  indentation  problems.  For  simulations  spanning  multiple  length  scales,  based  on  the 
continuum  mechanics  approach,  we  developed  a  parallel  GIMP  computational  method  using  the 
Structured  Adaptive  Mesh  Refinement  Application  Infrastructure  (SAMRAI).  SAMRAI  is  used 
for  multi-processor  distributed  memory  computations,  as  a  platform  for  domain  decomposition, 
and  for  multi-level  refinement  of  the  computational  domain.  Nested  computational  grid  levels 
(with  successive  spatial  and  temporal  refinements)  are  used  in  GIMP  simulations  to  improve  the 
computational  accuracy  and  to  reduce  the  overall  computational  time.  The  domain  of  each  grid 
level  is  divided  into  multiple  rectangular  patches  for  parallel  processing.  This  domain 
decomposition  embedded  in  SAMRAI  is  very  flexible  when^pplied  to  GIMP.  As  an  example  to 
verify  the  parallel  GIMP  computing  scheme  under  SAMRAI  parallel  computing  environment, 
numerical  simulations  with  multiple  length  scales  from  nanometer  to  millimeter  were  conducted 
on  a  2D  nanoindentation  problem. 

A  contact  algorithm  in  GIMP  has  also  been  developed  for  the  treatment  of  contact  pair 
between  a  rigid  indenter  and  a  deformable  workpiece.  GIMP  results  are  compared  with  finite 
element  results  on  indentation  for  validation.  A  GIMP  nanoindentation  problem  with  five  levels 
of  refinement  was  modeled  using  multi -processors  to  demonstrate  the  potential  capability  of  the 
parallel  GIMP  computation. 

3.4.  Structured  Mesh  Refinement  in  Generalized  Interpolation  Material  Point  (GIMP)  Method 
for  Simulation  of  Dynamic  Problems  (Ma  et  al.t  [8]) 

The  generalized  interpolation  material  point  (GIMP)  method,  recently  developed  using  a 
C1  continuous  weighting  function,  has  solved  the  numerical  noise  problem  associated  with 
material  points  just  crossing  the  cell  borders,  so  that  it  is  suitable  for  simulation  of  relatively 
large  deformation  problems.  However,  this  method  typically  uses  a  uniform  mesh  in  computation 
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when  one  level  of  material  points  is  used,  thus  limiting  its  effectiveness  in  dealing  with 
structures  involving  areas  of  high  stress  gradients. 

In  this  investigation,  a  spatial  refinement  scheme  of  the  structured  grid  for  GIMP  was  used 
for  simulations  with  highly  localized  stress  gradients.  A  uniform  structured  background  grid  is 
used  in  each  refinement  zone  for  interpolation  in  GIMP  for  ease  of  generating  and  duplicating 
structured  grid  in  parallel  processing.  The  concept  of  influence  zone  for  the  background  node 
and  transitional  node  is  introduced  for  the  mesh  size  transition.  The  grid  shape  function  for  the 
transitional  node  is  modified  accordingly,  whereas  the  computation  of  the  weighting  function  in 
GIMP  remains  the  same.  Two  other  issues  are  also  addressed  to  improve  the  GIMP  method.  The 
displacement  boundary  conditions  are  introduced  into  the  discretization  of  the  momentum 
conservation  equation  in  GIMP,  and  a  method  is  implemented  to  track  the  deformation  of  the 
material  particles  by  tracking  the  position  of  the  particle  comers  to  resolve  the  problem  of 
artificial  separation  of  material  particles  in  GIMP  simulations.  Numerical  simulations  of  several 
problems,  such  as  tension,  indentation,  stress  concentration  and  stress  distribution  near  a  crack 
(mode  I  crack  problem)  are  presented  to  verify  this  refinement  scheme. 

3.5.  Multiscale  Simulation  Using  Generalized  Interpolation  Material  Point  (GIMP)  Method  and 

Molecular  Dynamics  (MD)  (Ma  et  a [9]) 

A  new  method  for  multiscale  simulation  bridging  two  scales,  namely  the  continuum  scale 
using  the  generalized  interpolation  material  point  (GIMP)  method  and  the  atomistic  scale  using 
the  molecular  dynamics  (MD),  is  presented  and  verified  in  2D.  The  atomistic  strain  from  the 
molecular  dynamics  simulation  is  determined  through  interpolation  of  displacement  field  into 
Eulerian  background  grid  using  the  same  generalized  interpolation  functions  as  in  GIMP 
method.  The  atomistic  strain  as  defined  is  consistent  with  that  as  determined  from  virial  theorem 
for  interior  points,  but  provide  more  reasonably  stable  values  at  the  boundary  of  MD  region  or  in 
the  handshaking  region  between  MD-GIMP  so  that  physical  quantities,  such  as  the  displacement, 
interna!  forces,  and  energy  density  are  compatible  in  the  handshaking  region.  A  continuum 
materia!  point  in  the  continuum  is  split  into  smaller  material  points  using  multi-level  refinement 
until  it  has  nearly  reached  the  atom  size  to  couple  with  atoms  in  the  MD  region.  Consequently, 
coupling  between  GIMP  and  MD  is  achieved  by  using  compatible  deformation  and  force  fields 
in  the  transition  region  between  GIMP  and  MD. 
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The  coupling  algorithm  is  implemented  in  the  Structural  Adaptive  Mesh  Refinement 
Application  Infrastructure  (SAMRAI)  for  parallel  processing.  Both  mode  I  and  mode  II  crack 
propagation  problems  are  simulated  using  the  coupling  algorithm.  The  stress  field  near  the  crack 
tip  was  validated  by  comparing  the  results  from  coupled  simulation  with  purely  GIMP 
simulations  of  the  same  model.  Coupled  simulation  results  were  also  compared  with  pure  MD 
simulation  results.  In  both  cases,  a  very  good  agreement  was  obtained. 

3.6.  Combined  Numerical  Simulation  and  Nanoindentation  for  Determining  Mechanical 

Properties  of  Single  Crystal  Copper  at  Mesoscale  ( Liu  et  a/., [10]) 

Constitutive  laws  are  critical  in  the  investigation  of  mechanical  behavior  of  single  crystal 
or  polycrystalline  materials  in  applications  spanning  from  microscale  to  macroscale.  In  this 
investigation,  a  combined  FEM  simulation  and  experimental  nanoindentation  approach  was 
taken  to  determine  the  mechanical  behavior  of  single  crystal  copper  incorporating  the 
mesoplastic  constitutive  model.  This  model  was  implemented  in  a  user-defined  subroutine  in  3D 
ABAQUS/Explicit  code. 

Nanoindentation  was  modeled  using  the  multiscale  modeling  technique  involving 
mesoplasticity  and  elasticity,  i.e.,  mesoplastic  constitutive  model  was  used  near  the  local 
nanoindentation  region  (where  the  dislocations  are  generated)  while  elastic  constitutive  model 
was  used  in  rest  of  the  region  in  the  workmaterial.  The  meso-mechanical  behavior  of  the 
crystalline  structure  and  the  effect  of  the  mesoplastic  parameters  on  the  nanoindentation  load- 
displacement  relationships  were  investigated  in  the  FEM  analysis. 

Nanoindentation  tests  were  conducted  on  single  crystal  copper  to  determine  load- 
displacement  relationships.  Appropriate  mesoplastic  parameters  were  determined  by  fitting  the 
simulated  load-displacement  curves  to  the  experimental  data.  The  mesoplastic  model,  with 
appropriate  parameters,  was  then  used  to  determine  the  stress-strain  relationship  of  a  single 
crystal  copper  at  meso-scale.  The  effect  of  indenter  radius  (3.4-1000  mm)  on  material  hardness 
under  nanoindentation  was  simulated  and  found  to  match  the  experimental  data  for  several 
indenter  radii  (3.4,  10  and  500  mm).  A  comparison  of  the  topographies  of  nanoindentation 
impressions  in  the  experiments  with  FEM  results  showed  a  reasonably  good  agreement, 

3.7,  Monte  Carlo  Simulation  of  Nanometric  Cutting  (Komanduri  et  al,  [11]) 

Nanometric  cutting  of  single-crystal  materials  at  conventional  cutting  speeds  (5  ms  ')  is 
simulated  for  the  first  time  using  a  new  Monte  Carlo  method  that  is  applicable  to  systems  that 
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are  neither  canonical  nor  microcanonica!.  This  is  accomplished  by  defining  a  local  temperature 
in  the  cutting  zone  using  the  thermal  analysis  developed  by  Komanduri  and  Hou  [12]  for 
conventional  machining.  Extension  of  this  method  to  the  nanometric  regime  permits  an  accurate 
estimate  of  the  local  temperature  in  cutting,  This  temperature  is  then  employed  in  the  Boltzmann 
probability  distribution  function  that  is  used  to  determine  the  acceptance-rejection  of  Monte 
Carlo  moves  in  the  simulation.  Since  cutting  speed  is  closely  related  to  cutting  temperature,  the 
cutting  speed  enters  the  calculation  via  the  thermal  analysis  equations.  The  method  is  applied  to 
nanometric  cutting  of  single-crystal  aluminum  with  the  crystal  oriented  in  the  (001)  plane  and 
cut  in  the  [100]  direction.  Three  positive  rake  cutting  tools,  namely,  10°,  30°,  and  45°  are 
employed  to  investigate  the  effect  of  rake  angle  on  the  forces,  the  specific  energy,  and  the  nature 
of  the  chip  formation.  The  method  is  evaluated  by  direct  comparison  with  corresponding 
molecular  dynamics  simulations  conducted  under  the  same  conditions. 

3.8.  Monte  Carlo-Steepest  Descent  (MC-SD)  Simulations  of  Nanometric  Cutting  (Narulkar  el 

at.,  [13]) 

In  order  to  reduce  the  computational  time,  Monte  Carlo  (MC)  simulations  of  nanometric 
cutting  have  been  modified  to  include  a  combination  of  steepest  descent  (SD)  and  Monte  Carlo 
procedures.  This  MC-SD  combination  is  found  to  reduce  the  required  computational  times  by  a 
factor  of  at  least  two  to  three  over  those  achieved  using  MC  methods  alone.  The  MC-SD  method 
is  applied  to  the  nanometric  cutting  of  single-crystal  aluminum  along  the  (100)  plane  with 
different  rake  angle  tools  at  a  cutting  speed  of  5  m/s.  The  results  obtained  from  the  MC-SD 
calculations  are  found  to  be  almost  identical  to  those  resulting  from  the  MC  simulations. 

3.9.  Ah  initio  Potential-Energy  Surfaces  for  Complex,  Multi-channel  Systems  Using  Modified 

Novelty  Sampling  and  Feedforward  Neural  Networks  (Raff  et  at.,  [14]) 

A  neural  network/trajectory  approach  is  presented  for  the  development  of  accurate 
potential-energy  hypersurfaces  that  can  be  utilized  to  conduct  ah  initio  molecular  dynamics 
(AIMD)  and  Monte  Carlo  studies  of  gas-phase  chemical  reactions,  nanometric  cutting,  and 
nanotribology,  and  of  a  variety  of  mechanical  properties  of  importance  in  potential 
microelectromechanical  systems  applications.  The  method  is  sufficiently  robust  such  that  it  can 
be  applied  to  a  wide  range  of  polyatomic  systems.  The  overall  method  integrates  ab  initio 
electronic  structure  calculations  with  importance  sampling  techniques  that  permit  the  critical 
regions  of  configuration  space  to  be  determined.  The  computed  ab  initio  energies  and  gradients 
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are  then  accurately  interpolated  using  neural  networks  (NN)  rather  than  arbitrary  parametrized 
analytical  functional  forms,  moving  interpolation  or  least-squares  methods.  The  sampling 
method  involves  a  tight  integration  of  molecular  dynamics  calculations  with  neural  networks  that 
employ  early  stopping  and  regularization  procedures  to  improve  network  performance  and  test 
for  convergence.  The  procedure  can  be  initiated  using  an  empirical  potential  surface  or  direct 
dynamics. 

The  accuracy  and  interpolation  power  of  the  method  has  been  tested  for  two  cases,  the 
global  potential  surface  for  vinyl  bromide  undergoing  unimolecular  decomposition  via  four 
different  reaction  channels  and  nanometric  cutting  of  silicon.  The  results  show  that  the  sampling 
methods  permit  the  important  regions  of  configuration  space  to  be  easily  and  rapidly  identified, 
that  convergence  of  the  NN  fit  to  the  ah  initio  electronic  structure  database  can  be  easily 
monitored,  and  that  the  interpolation  accuracy  of  the  NN  fits  is  excellent,  even  for  systems 
involving  five  atoms  or  more.  The  method  permits  a  substantial  computational  speed  and 
accuracy  advantage  over  existing  methods,  is  robust,  and  relatively  easy  to  implement. 

3. 10.  Prediction  of  Molecular-Dynamics  Simulation  Results  Using  Feedforward  Neural 

Networks:  Reaction  of  Ci  Dimer  with  an  Activated  Diamond  (100)  Surface  (Agrawal  et  aL 
[15]) 

A  new  approach  involving  neural  networks  (NN)  combined  with  molecular  dynamics 
(MD)  has  been  used  for  the  determination  of  reaction  probabilities  as  a  function  of  various  input 
parameters  for  the  reactions  associated  with  the  chemical  vapor  deposition  (CVD)  of  carbon 
dimers  on  a  diamond  (100)  surface.  The  data  generated  by  the  simulations  have  been  used  to 
train  and  test  neural  networks.  The  probabilities  of  chemisorption,  scattering,  and  desorption  as  a 
function  of  input  parameters,  such  as  rotational  energy,  translational  energy,  and  direction  of  the 
incident  velocity  vector  of  the  carbon  dimer,  have  been  considered.  The  very  good  agreement 
obtained  between  the  predictions  of  NN  and  those  provided  by  MD  and  the  fact  that,  after 
training  the  network,  the  determination  of  the  interpolated  probabilities  as  a  function  of  various 
input  parameters  involves  only  the  evaluation  of  simple  analytical  expressions  rather  than 
computationally  intensive  algorithms  show  that  neural  networks  are  extremely  powerful  tools  for 
interpolating  the  probabilities  and  rates  of  chemical  reactions.  We  also  find  that  a  NN  fits  the 
underlying  trends  in  the  data  rather  than  the  statistical  variations  present  in  the  MD  results. 
Consequently,  NN  can  also  provide  a  computationally  convenient  means  of  averaging  the 


13 


statistical  variations  inherent  in  MD  calculations.  In  the  present  case  the  application  of  this 
method  is  found  to  reduce  the  statistical  uncertainty  in  the  MD  results  by  a  factor  of -3.5. 

3.11.  Monte  Carlo  Simulation  of  Void-nucleated  Melting  of  Silicon  via  Modification  in  the 

Tersoff  Potential  Parameters  (Agrawal  et  ai,  [16]) 

Molecular  dynamics  (MD)  simulations  of  silicon  melting  reported  in  the  literature  using 
Tersoff  s  potential  give  melting  temperatures  about  50%  higher  than  the  experimental  value.  To 
address  this  discrepancy,  we  proposed  a  modification  to  the  values  of  the  parameters  of  the 
Tersoff  potential.  This  modification  involves  a  change  in  the  magnitude  of  3  of  the  12 
parameters  in  the  Tersoff  potential  as  an  effective  means  for  bringing  down  the  melting  point 
close  to  the  experimental  value,  The  melting  point  is  determined  by  performing  Monte  Carlo 
(MC)  simulations  using  the  empirical  void-nucleated  melting  procedure.  In  addition  to  an 
agreement  of  the  computed  melting  point  with  the  experiment,  the  modified  parameters  also 
bring  the  density  of  the  liquid  and  the  solid  at  the  melting  point  into  good  agreement  with  the 
experiment  without  significantly  altering  the  density  and  energy  values  of  the  solid  crystal  at 
room  temperature.  The  coordination  number  and  specific  heat  of  the  liquid  are  also  found  to  be 
in  better  agreement  with  the  experiment  when  the  modified  set  of  parameters  is  used.  A 
comparison  of  density  over  a  wide  range  of  temperatures $hows  that  the  density  of  the  solid 
predicted  by  the  Tersoff  potential  with  the  modified  parameters  is  larger  than  that  given  by  the 
unmodified  parameters.  This  difference,  however,  is  not  appreciable  at  room  temperature;  it 
increases  with  temperature  and  is  about  1%  at  the  melting  point.  The  computed  cohesive  energy 
of  the  solid,  the  point  defect  energy  corresponding  to  a  vacancy,  and  the  surface  energy  values 
for  (100)  and  (1 1 1)  surfaces  are  also  found  to  be  nearly  the  same  for  the  modified  as  well  as  the 
unmodified  parameters. 

3. 12.  Prediction  of  Molecular-Dynamics  Simulation  Results  Using  Feedforward  Neural 

Networks:  Reaction  of  C2  Dimer  with  an  Activated  Diamond  (100)  Surface  (Agrawal  et  al 

[i7].; 

The  neural  network  (NN)  procedure  to  interpolate  ah  initio  data  for  the  purpose  of 
molecular  dynamics  (MD)  simulations  has  been  tested  on  the  SiO:  system,  Unlike  other  similar 
NN  studies,  here,  we  studied  the  dissociation  of  SiC^  without  the  initial  use  of  any  empirical 
potential.  During  the  dissociation  of  S1O2  into  Si+O  or  Si+O;,  the  spin  multiplicity  of  the  system 
changes  from  a  singlet  to  a  triplet  in  the  first  reaction  and  from  a  singlet  to  a  pentet  in  the  second. 
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In  this  inverstigation  we  employed  four  potential  surfaces.  The  first  is  a  NN  fit  [NN(STP)]  to  a 
database  comprising  the  lowest  of  the  singlet,  triplet,  and  pentet  energies  obtained  from  density 
functional  calculations  in  6673  nuclear  configurations.  The  other  three  potential  surfaces  are 
obtained  from  NN  fits  to  the  singlet,  triplet,  and  pentet-state  energies.  The  dissociation  dynamics 
on  the  singlet-state  and  NN(STP)  surfaces  are  reported.  The  results  obtained  using  the  singlet 
surface  correspond  to  those  expected  if  the  reaction  were  to  occur  adiabatically.  The  dynamics 
on  the  NN(STP)  surface  represent  those  expected  if  the  reaction  follows  a  minimum-energy 
pathway.  This  study  on  a  small  system  demonstrates  the  application  of  NNs  for  MD  studies 
using  ab  initio  data  when  the  spin  multiplicity  of  the  system  changes  during  the  dissociation 
process. 

3.13.  A  New  Method  for  Estimating  local  Strains  using  Nanoindentation  and  Atomic  Force 

Microscopy  (Varghese  [18]) 

A  new  method  was  developed  for  determining  strains  in  very  small  regions  using  a  micro- 
tensile  stage  and  nanoindentation  as  tools.  Indents  formed  by  nanoindentation  are  used  as  probes 
to  measure  local  deformations  in  a  small  region  that  can  hardly  be  measured  with  most  existing 
techniques-  Strains  were  introduced  in  samples  via  a  small  in-situ  tensile  testing  system, 
originally  developed  for  in-situ  testing  inside  an  SEM,  and  modified  and  adapted  here  for  use 
with  the  AFM,  Variation  of  local  strains  in  very  small  regions  compared  to  global  strains  has 
also  been  established.  Two  approaches  were  used  to  estimate  strains.  In  the  first  approach,  length 
changes  of  a  triangular  indent  are  used  to  determine  normal  strains  and  the  corresponding  normal 
and  shear  strain  components.  In  the  second  approach,  displacement  changes  in  any  pair  of  three 
indents  are  monitored  for  computation  of  normal  strains  and  the  associated  normal  and  shear 
strain  components  using  strain  transformation  relations.  Indents  were  made  on  a  polycrystalline 
copper  tensile  specimen,  and  the  residual  indent  geometry  is  monitored  using  an  atomic  force 
microscope  (AFM). 

The  results  show  that  the  shape  of  the  residual  indents  depends  on  the  crystal  orientation 
and  the  angle  the  slip  bands  make  with  individual  grains.  The  technique  developed  is 
complementary  to  existing  strain  measurement  techniques,  and  is  especially  suitable  at  micro- 
and  nano-scales.  The  techniques  are  validated  at  two  scales,  namely,  microscale  using 
microindents  formed  by  a  Nanoindentation  system,  and  nanoscale  using  nanoindents  made  by 
AFM  tips.  Strains  were  compared  by  measurement  of  strains  by  this  technique  and  the 
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deformations  determined  using  digital  image  correlation  (DIC)  under  relatively  small 
deformations.  Further  validation  of  this  technique  is  made  by  performing  a  virtual  “digital” 
tensile  test  consisting  of  simultaneous  digital  stretching  and  shrinking  of  an  image  of  the 
nanoindents  by  known  values  and  analyzing  the  strains.  Reasonably  good  agreement  has  been 
obtained  at  both  scales.  The  validated  technique  is  then  used  to  measure  relative  deformations  of 
multiple  grains  and  deformations  within  a  single  grain  in  the  polycrystalline  copper.  Strains 
associated  with  a  slip  band  were  determined  by  this  technique 
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A  new  mul [1  scale  simulation  approach  is  introduced  that  couples  atomistic-scale 
simulations  using  molecular  dynamics  |MD)  with  continuum-scale  simulations 
using  the  recently  developed  material  point  method  (MPM),  In  MPM,  material 
continuum  is  represented  by  a  finite  collection  of  material  points  carrying  a  El 
relevant  physical  characteristics,  such  as  mass,  acceleration,  velocity*  strain  and 
stress.  The  use  of  material  points  at  the  continuum  level  provides  a  natural 
connection  with  the  atoms  in  the  lattice  at  the  atomistic  scale,  A  hierarchical  mesh 
refinement  technique  in  MPM  is  presented  to  scale  down  the  continuum  level  to 
the  atomistic  level,  so  that  material  points  at  the  fine  level  in  MPM  arc  allowed 
to  directly  couple  with  the  atoms  in  MD,  A  one-lo-onc  correspondence  of  MD 
atoms  and  MPM  points  is  used  in  the  transition  region  and  non-local  elastic 
theory  is  used  to  assure  compatibility  between  MD  and  MPM  regions,  so  that 
seamless  coupling  between  MD  and  MPM  can  be  accomplished.  A  silicon  single 
crystal  under  uniaxial  tension  is  used  in  demonstrating  the  viability  of  the 
technique.  A  TersofTtype*  three-body  potential  was  used  in  the  MD  simulations. 
The  coupled  MD  MPM  simulations  show  that  silicon  under  nanomctric  tension 
experiences*  with  increasing  elongation  in  elasticity*  dislocation  generation  and 
plasticity  by  slip*  void  formation  and  propagation*  formation  of  amorphous 
structure,  necking,  and  final  rupture.  Results  are  presented  in  terms  of 
stress  strain  relationships  uL  several  strain  rates*  as  well  as  the  rate  dependence 
of  uniaxial  material  properties.  This  new  multiscale  computational  method  has 
potential  for  use  in  cases  where  a  detailed  atomistic-level  analysis  is  necessary 
in  localized  spatially  separated  regions  whereas  continuum  mechanics  is  adequate 
in  the  rest  of  the  material 


1.  Introduction 

Material  failure  at  all  length  scales  experiences  such  processes  as  clastic  deformation* 
dislocation  generation  and  propagation,  cleavage*  crack  initiation  and  growth,  and 
final  rupture.  In  recent  years,  combined  atomistic  and  continuum  simulations  have 
received  increasing  attention  due  to  their  potential  linkage  between  structure 
property  relationships  from  nano  to  macro  levels  [l  II],  Various  computational 
methods  have  been  developed  to  couple  spatial  and  temporal  scales*  Figure  la  is  a 
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Figure  i.  (a)  Schematic  of  simulations  at  various  levels,  (b)  Schematic  of  various  length  and 
time  scales  in  tribology. 


schematic  of  the  various  computer  simulation  methods  used  in  the  development  of 
scaling  laws  as  a  function  of  spatial  and  temporal  variables  [12].  At  the  subatomic 
level t  vve  use  quantum  mechanics  or  ab  initio  calculations  to  develop  potential  energy 
functions.  With  an  increase  in  length  (or  time)  scale,  we  model  material  behaviour 
using  molecular  dynamics  (MD)  and  Monte  Curio  (MC)  simulations,  then  to  micro 
or  mcsoplustie,  and  finally  to  continuum  mechanics,  such  as  finite  element  method 
(FEM).  Figure  lb  is  a  schematic  showing  various  spatial  and  temporal  scales  for  a 
typical  tribological  phenomenon  for  a  wide  range  of  applications  [13].  It  may  be 
noted  at  the  outset  that  any  scaling  law  developed  would  involve  both  temporal  and 
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spatial  scales.  At  the  atomistic  level,  MD  is  the  preferred  mode  while  at  the 
continuum  level,  FEM  is  the  dominant  numerical  technique  used.  Consequently, 
most  published  results  on  combined  atomistic  and  continuum  simulations  have 
focused  on  MD/FEM  coupling. 


2,  Brief  review  of  literature 

In  combined  atomistic  and  continuum  simulations,  material  is  divided  into  atomistic 
and  continuum  regions.  Seamless  coupling  is  required  for  the  combined  MD/FEM 
simulations,  which  is  a  key  issue  in  multiscale  simulations.  Using  a  combined  finite- 
clement  and  atomistic  model,  KohlhofT  et  aL  [14]  introduced  the  transition  region 
between  atomic  lattice  and  continuum.  An  early  suggestion  for  coupling  the 
continuum  and  the  atomistic  regimes  was  due  to  Hoover  et  aL  [15],  and  more 
recently  due  to  Curtin  and  Miller  {16]  and  Shiari  et  aL  [17],  Ghonicm  et  a /,  [18],  Liu 
et  aL  [  19]  and  Lu  and  Kaxiras  [20]  have  reviewed  in  detail  various  coupling  methods 
linking  atomistic  to  continuum  simulations. 

The  hybrid  MD/FEM  method  introduces  an  MD/FEM  handshake  region  for  the 
interface  between  the  MD  and  the  FEM  regions,  where  MD  atoms  and  FEM  nodes 
are  coupled  by  mutual  displacement  boundary  conditions.  The  kinetic  and  strain 
(potential)  energy  is  defined  for  the  entire  system  including  the  handshaking 
interactions  at  the  interface.  The  FEM  mesh  in  the  MD/FEM  handshaking  region 
is  scaled  down  to  coincide  with  the  atomic  lattice.  A  one-to-one  correspondence 
of  MD  atoms  and  FEM  nodes  is  required  in  the  transition  region.  The  FEM  mesh 
is  gradually  made  coarser  in  areas  far  away  from  the  interested  atomistic  region. 
Additionally,  the  width  of  the  transition  region  is  taken  to  be  equal  to  the  cut-off 
distance  of  the  interaction  potential  used  in  the  MD  region.  This  provides  a  complete 
set  of  neighbours  within  the  interaction  range  for  all  atoms  in  the  MD  region. 
Atoms/FEM  nodes  that  belong  to  the  transition  region  not  only  interact  via  the 
interaction  potential  with  the  MD  region  but  are  also  part  of  the  nodes  in  the  FEM 
region.  The  positions  and  velocities  of  these  atoms/FEM  nodes  in  the  transition 
region  (both  on  the  MD  side  and  on  the  FEM  side)  must  be  consistent  with  each 
other  The  forces  exerted  on  the  atoms  in  the  transition  region,  due  to  the  interaction 
with  the  MD  region,  make  up  the  external  forces  on  the  FEM  nodes  for  the  FEM 
simulation.  However,  under  large  deformations,  the  FEM  elements  become  distorted 
and  it  is  difficult  to  implement  the  entire  computational  process  in  the  FEM 
simulations. 

Recently,  a  new  computational  method,  namely,  the  material  point  method 
(MPM)  was  developed  by  Sulsky  and  others  [21,  22]  at  the  University  of  New 
Mexico  with  support  from  the  Sandia  National  Laboratories,  Compared  to  FEM, 
MPM  has  the  following  advantages:  (1)  MPM  is  able  to  handle  large  deformation  in 
a  more  natural  manner  so  that  mesh  lock-up  present  in  FEM  is  avoided;  (2)  MPM 
can  easily  couple  with  molecular  dynamics  (MD)  simulations  because  of  the  use 
of  material  points  (similar  to  atoms  used  in  MD)  instead  of  elements  in  FEM: 
(3)  parallel  computation  is  more  straightforward  in  MPM  because  of  the  use  of 
a  grid  structure  that  is  consistent  with  parallel  computing  grids:  and  (4)  use  of  the 
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background  grid  in  MPM  enables  structured  adaptive  refinement  for  local  interested 
region.  As  a  result.  MPM  is  a  logical  choice  for  coupling  MD  to  continuum 
approach  for  multisealc  simulations.  However,  the  conventional  MPM  discretizes 
the  material  using  a  regular  grid  with  uni  form-square  (two-dimensional)  or  cubic 
(three-dimensional)  cells.  Such  a  mesh  is  inefficient  in  dealing  with  stress  concen¬ 
tration  issues  and  cannot  be  refined  to  scale  down  the  size  to  different  levels  [23]. 
Consequently,  a  hierarchical  mesh  refinement  technique  in  MPM  simulations  has 
been  developed  in  this  investigation. 

In  this  paper,  a  seamless  M  D/M  PM  coupling  approach  is  presented  in  which 
MD  modelling  is  used  in  the  local  area  of  interest  where  the  variation  is  at  an 
atomistic  scale  and  MPM  simulation  in  the  rest  of  the  material.  A  multisealc 
simulation  of  uniaxial  tension  is  carried  out  on  a  silicon  specimen  at  different  strain 
rates  to  illustrate  this  methodology.  A  hierarchical  mesh  refinement  technique  is 
introduced  in  MPM  to  scale  down  from  the  continuum  level  into  the  atomistic  level 
near  the  MD,  MPM  transition  region  where  the  MPM  points  overlap  the  MD  atoms. 
A  seamless  coupling  is  realized  for  complete  compatibility  of  both  strain  and  stress 
for  MD  and  MPM  regions.  In  tensile  simulations,  the  effects  of  strain  rate  on 
the  nature  of  deformation  and  material  properties  are  investigated  based  on  the 
simulated  stress  strain  relations  for  silicon. 


3.  Computational  methodology 

3.1.  Molecular  Dynamics 

6 

3.1.1.  Theoretical  background.  In  molecular  dynamics  (MD),  the  Newtonian 
second  law  for  the  time  evolution  of  a  system  is  expressed  by  [24,  25] 

mid2ti/dt2  =  f,,  (/=  I . -/V)  (I) 

where  N  is  the  total  number  of  atoms,  mt  is  the  mass  of  the  /th  atom  with  position 

vector  r„  f  =  -V,^(r(,  n* _ r v >  is  the  force  acting  on  the  /th  atom  due  to 

interaction  with  other  atoms  in  the  system,  and  F(r|,r? . r^)  is  the  interaction 

potential.  The  initial  positions  and  velocities  of  the  atoms,  together  with  the 
interaction  potential,  define  the  whole  set  of  thermodynamic,  clastic  and  mechanical 
properties  of  the  material. 

The  set  of  JN  second-order  differential  equations,  equation  (I),  is  often  solved 
by  recasting  it  as  a  set  of  6/V  first-order  Hamiltonian  equations  of  motion,  thus 

dpj/df  —  ft  =  -V,F(  . Tv), 

i/Xi/dt  =  p  if  mi 

where  p,  are  momenta.  Under  the  given  initial  positions  and  momenta  of  the  system, 
integration  of  equation  (2)  yields  the  total  trajectory  of  the  system.  With  the 
knowledge  of  the  trajectories  of  all  the  atoms,  one  can  calculate  spatial  and  temporal 
distributions  of  energy,  temperature  and  pressure,  as  well  as  monitor  the  structural 
and  phase  changes  in  the  system. 
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The  atomistic  potential  for  the  interaction  between  Si  Si  atoms  herein  is  a 
general  form  of  Tersoff-type  three-body  potential  [26,  271.  The  total  energy  as  a 
function  of  atomic  coordinates  is  given  by 


where  /  and  /  label  the  atoms  of  the  system  and  W„  is  the  bond  energy  of  all  the 
atomic  bonds  in  the  control  volume  represented  by 

+  WMl  <4> 

where  rH  is  the  interatomic  distance.  The  jR  and  fA  functions  are  the  repulsive  and 
attractive  forces  between  atoms  /  and  j*  respectively.  These  functions  are  expressed  by 

=  AjjCxp(-k0nj)%  (5) 


and 

1a  ( ru )  =  -  Bij  exp  ( -iLijr#)  .  (6 ) 


The f  (rif)  function  acts  as  a  cut-olT  function  that  smoothly  attenuates  the  interaction 
as  r,/  approaches  the  cutoff  radius  Sa. 


1  1 

for  ry  <  Ry 

.Mrv)  —  ■ 

1 

2 

1  ,  (nj  -  Ry 
+  2V'0i,\Sij  -  R,, 

for  R„  <  r„  <  Sy 

(7) 

0 

for  r,j  >  Sy 

The  h„  function  is  given  by 

/ 

v-t  1/1(1,) 

by  =  Xijy 1  +  ft- 

«)  ■ 

(8) 

where 

(9) 

with 


=  I  +  £  ~  — o 

dr  df  +  {hi  —  cos  &ijh) 


(10) 


where  f/„*  is  the  bond  angle  between  bonds  ij  and  ik.  The  parameters  used  in  the 
TersofTs  potentials  for  silicon.  i,e.  A.  B.  R,  S,  X,  (i,  fi,  n,  c ,  d  and  It,  are  given  in 
table  1  [26,  27], 


3.1.2.  Strain  and  stress.  In  MD  simulations,  the  stress  strain  relation  is  usually 
determined  from  tensile  models  [28.  29],  In  each  increment  of  applied  deformation, 
a  nominal  strain  can  he  determined  by  the  nominal  length  change  normalized  by 
initial  length  for  the  entire  MD  model  in  the  loading  direction.  In  thermodynamics. 
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Table  I .  Material  parameters  in  Tersoff s  potential  for  silicon  [2ft,  27). 


.-i 

H 

k 

f , 

R 

S 

Parameter 

(cV) 

<eV> 

(A"1 

)  (A'1) 

fi  n  c  d 

H 

(A) 

[Al 

Silicon 

l  .83  x  103 

4.71  x  10; 

2.48 

1,73  i 

.1  X  I0-5  0.7')  1.0  X  10s  16.22 

—0.6 

2,7 

3.0 

the  stress  in  a  solid  is  defined  as  the  partial  derivative  of  the  internal  energy  with 
respect  to  the  strain  per  unit  volume.  In  continuum  mechanics,  the  components  of 
the  stress  tensor  for  a  hyperclast ic  material  arc  given  by 


where  V  is  the  volume  of  the  solid,  E  is  the  total  internal  energy.  are  the 
components  of  the  strain  tensor  and  S  is  entropy.  In  special  eases,  if  the  internal 
energy  is  the  strain  energy.  Hooke’s  law  is  the  result  of  equation  (II).  In  Ml), 
the  total  energy  is  the  summation  of  the  energy  of  individual  atoms,  £“.  and  is 
expressed  as 

£“  =  T  +  If  =  ^  AW)1  +  (12) 

where  7"  is  the  kinetic  energy,  If  the  potential  energy,  Af'  the  mass,  v"  the 
magnitude  of  its  velocity  and  4>"(r)  the  potential  energy  at  the  atom  location  r. 
Thus,  the  stress  components  for  a  given  atom  arc  written  as 

=  +  "3) 

where  V“  is  the  atomic  volume  or  atom  a.  v"  and  v"  are  components  of  velocity. 

arc  the  components  of  force  between  atoms  ot  and  fi  from  the  derivative  of 
the  potential  energy,  and  r“/f  arc  the  components  of  the  distance  between  atoms  «■ 
and  fi.  The  nominal  stress  is  computed  as  the  average  of  the  atomic  stresses  for 
the  volume  of  the  model.  Thus,  the  nominal  stress  components  of  each  model  arc 
expressed  by 


where  V  is  the  volume  of  the  MD  model  and  V  =  lA\ 


A.  2.  The  Material  Point  Method  (  MPM) 

3.2,1.  Theoretical  background.  In  the  material  point  method  (MPM)  [21.  22] 
a  material  continuum  is  discretized  into  a  finite  collection  of  N  material  points, 

as  shown  in  figure  2.  Each  material  point  is  assigned  a  mass  («),„/;=  I . N\ 

consistent  with  the  material  density  and  volume  of  the  point,  and  all  other  variables. 
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such  as  position,  acceleration,  velocity,  strain  ami  stress.  The  motion  equations  arc 
solved  at  the  points  for  each  time  step  of  the  analysis  using  a  background 
computational  grid.  Information  is  transferred  from  the  material  points  to  grid 
nodes  by  using  standard  finite  element  shape  functions, 

tt 

0fx)  =  (15) 

(=i 

where  n  is  the  number  of  nodes  in  the  grid  and  subscript  i  refers  to  the  nodal  values 
of  <£(x)  containing  x. 

The  Newtonian  motion  equations  for  material  points  arc 

pdlufdtl  —  pVas  +  pib ,  (16) 

where  the  mass  density  P  ~  mp^x  ^  xp^  mp  *s  niaterial  point  mass,  x;i  is 
material  point  location,  <$(x  -  *,p)  is  the  Delta  function,  u  is  the  displacement,  the 
acceleration  a  =  d2u/dr,  f h  is  the  specific  body  force  and  as  is  a  specific  stress  tensor, 
which  is  the  stress  tensor  a  divided  by  the  mass  density.  The  material  constitutive 
relation  can  be  represented  as 

a”  =  Cwf  and  s  =l[(Vu)  +  (Vu)T],  (17) 

where  is  the  fourth-order  elasticity  tensor  and  e  is  the  strain  tensor.  This  paper 
will  consider  only  linear  clastic  materials  at  the  continuum  level. 

Applying  the  interpolation  scheme  for  acceleration  into  the  weak  form  of 
Newton's  equation  gives 

(=1 


(IS) 
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where  the  mass  matrix  is  given  by 

(V 

and  the  internal  and  the  external  forces  on  the  node  /  are 

,V  JV 

f,ml  =  -  mra"{xll)VNi{xll)  and  f/  v'  =  f;  +  £ mpth(xp)N,ixP),  (20) 

/)=  I  P=  I 

where  r,  is  the  surl'aee  traction  associated  with  grid  point  i. 

The  grid  point  accelerations  obtained  from  equation  (IS)  are  then  used  to  update 
the  position,  velocity,  stress,  strain  and  temperature  of  the  material  points. 


3.2.2.  Hierarchical  mesh  refinement.  The  conventional  MPM  method  uses  a  regular 
grid  cell  mesh  with  equally  spaced  nodes.  Adaptive  mesh  refinement  in  areas  with 
stress  concentration  is  not  possible  using  such  a  mesh.  To  accommodate  multiple 
level  meshes,  a  hierarchical  mesh  refinement  technique  is  presented  herein.  For  the 
two-dimensional  (2D)  situation,  the  hierarchical  mesh  refinement  with  three  levels 
using  Tour-node  regular  cells  (l,  4,  7  10)  and  five-node  transition  cells  (2,  3,- 5,  6)  arc 
shown  in  figure  3a.  For  the  regular  cell  with  four  nodes  in  figure  3b,  the  shape 
functions  used  in  equation  (15)  arc  expressed  as  [30]: 


Af,  -*XI 
tfs=l(l  +$X1  +17). 

4 


AA=^(l  +*)(!  -*)). 
N*  =^(l  -$)(1  +  '?)■ 

4 


(21) 


4  3 

(b) 

Level  I 

4  3 

Ixvel  II 

Level  III 


I- 1  pit  re  3,  Hierarchical  mesh  refinement  m  2D  MPM. 
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where  ($.  n)  a re  natural  coordinates,  as  shown  in  figure  3b,  Shape  functions  for 
transition  cells  with  five  nodes  are  given  by: 

-^(1-IXI  -  M=^<1  +  1X1  -irf-jM. 


M  +*)('  +  0).  JV4  =  -£)(l  +17).  (22) 

M  =^(1  -*:>(>  —  *?). 


Similarly,  for  the  three-dimensional  (3D)  situation,  a  hierarchical  refinement 
mesh  with  two  levels  is  shown  in  figure  4a  using  regular  cells  with  eight  nodes  and 
transition  cells  with  nine  (cells  2.  4  and  6)  and  thirteen  nodes  (cells  I.  3  and  5).  The 
shape  functions  for  regular  cells  with  eight  nodes  in  figure  4b  are  given  by  [30); 


M  =  l  +  ft)(  1  +  »)o)(l  +  fn)i  ft  =  Sft 


(23) 


no  =  nn>,  ft -?{((/=! . 8). 


where  i(f, I  (I.  I,  l):2(-l,  1,  1);  3  (-1,-1,  l);  4(1, -I.  1);  5(1,  I, -1);  6(-l,  1,-1); 
7(-l.  -I.  -1):  8(1.  -1,  -1).  and  (£,  t),  f)  arc  natural  coordinates  as  shown  in  figure  4. 
The  shape  functions  for  transition  cells  from  nine  to  thirteen  nodes  in  figure  4h 
are  expressed  by 

M  =  -<■(  1  +  £$,)(  1  +  »pp)(  1  +  fCi)  (i  =  I , 8), 

6 

A' i  =  iV«,  N2  =  -^/Vb, 


M  =  jV3  Mi  -  ^Ml  -^Mt-  M  =  M. 

A/ 5  =  N(,  =  M,  —  -  A/«  -  -  A/|d  —  -  /V|Jt 

(24) 

A/7  =  M  —  -  Afo  —  ^  M2  —  ^  Mj-  Nk  =  /V*, 

N,  -  =  ^(l  -|2)(1  -u)(1  +  0(0,  /V|0  =  i(l  -£)<1  -  i/H  1  -C)(-ft, 

Mi  »^(l  -f2)(l  -  u)(l  -O(-C),  Mi  =  “(1+^(1  ->))(!  -  cm, 

Mj  =5(1  -*2)U  -^>(1  -  I?). 


[n  the  3D  MPM  computational  formulation  using  the  hierarchical  mesh  with 
multiple  levels,  equation  (23)  is  used  for  regular  cells  at  each  individual  level  while 
equation  (24)  is  employed  for  the  transition  cells  between  neighbouring  two  levels. 


29 


29 RO  H.  La  el  al. 


Figure  4.  3D  MPM  regular  and  transition  cells. 


3.3.  Seamless  Coupling  between  MDfMPM 

Simulation  of  dynamic  problems  using  both  MD  and  MPM  requires  integration  of 
the  equations  of  motion,  equations  (1)  and  The  format  of  these  two  equations 
is  similar,  leading  to  a  natural  coupling  of  the  two  types  of  simulations.  There  arc 
essentially  three  regions  in  coupled  M  D/M  PM  simulations.  They  arc  MD.  MPM. 
and  their  transition  regions.  The  MD  region  is  modelled  using  the  approach 
described  in  section  3.1.  In  the  MD  region,  atoms  are  initially  placed  at  diamond 
lattice  points.  The  MPM  region  is  modelled  using  the  method  described  in  section 
3.2.  In  the  MPM  region,  eight  material  points  arc  equally  positioned  in  each  cubic 
cell.  The  coupling  of  the  two  descriptions  of  the  media  is  brought  about  using  a 
transition  region  where  both  MPM  points  and  MD  atoms  arc  initially  overlapped 
and  coincide  with  the  atomic  lattice.  The  width  of  (he  transition  region  is  equal  to 
the  cut-off  distance  of  the  interaction  potential  used  in  the  MD  region.  This  provides 
a  complete  set  of  neighbours  within  the  interaction  range  for  all  atoms  in  the  MD 
region,  Atoms/points  that  belong  to  the  transition  region  not  only  interact  via  the 
interaction  potential  with  the  MD  region  but  also  are  part  of  points  in  the  MPM 
region.  The  positions  and  velocities  of  these  atoms/points  in  the  transition  region 
(both  on  the  MD  side  and  on  the  MPM  side)  must  be  consistent  with  each  other. 
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Table  2.  Material  properties  of  single  crystal  a-silicon  (31). 


Material 

Lattice 

constant 

Density 

Young's 

modulus 

Cu 

C|i  C.W 

Bulk 

Poisson's  modulus 

Fracture 

stress 

properties 

(A) 

<getirJ) 

(Ci  Pa) 

(GPa) 

(GPa)  (GPa) 

ratio 

(GPa) 

(MPa) 

Silicon 

5.43 

2.33 

130 

I65.7S 

63.94  79.57 

0.2K 

100 

62 

The  atom  velocities  in  the  transition  region  due  to  the  interaction  with  the  MD 
region  furnish  the  loading  boundary  condition  on  overlapped  points  for  the  MPM 
simulation.  To  avoid  a  density  mismatch  at  the  boundary,  the  mass  at  each  point  for 
MPM  in  the  transition  region  is  set  equal  to  the  mass  of  each  MD  atom. 

Using  the  above  method  to  deal  with  the  transition  region,  the  displacement 
as  well  as  strain  fields  arc  compatible,  while  stress  fields  possibly  are  not  compatible. 
To  ensure  the  complete  compatibility  of  both  strain  and  stress  for  seamless  coupling 
between  MD  and  MPM,  we  use  the  non-local  elasticity  theory  [15]  in  this 
investigation.  The  elastic  energy  is  expanded  in  terms  of  the  strain  r  in  the  Taylor 
series  form  as; 


E(e)  -  E(  0)  +  (3E/%,,)|n£,/  +  -  [8?£/(8e,yaeA7)]|0«i>-e*7 


(25) 


Therefore,  the  material  clastic  constants  are  required  to  satisfy  the  following 
conditions: 


Qj  =  (3£/itey)|u  (first-order  clastic  constants), 
djiii  =  [32E/(3rfJ3^7)]|0  (second-order  elastic  constants), 
Ci,Umn  =  [B*E/(B£iiB£u<te . )]|0  (third-order  clastic  constants). 


Using  the  natural  state  as  the  reference  configuration,  the  first-order  clastic 
constants  arc  2cro  and  only  the  second-  and  higher-order  elastic  constants  need  to 
be  assigned  to  the  continuum.  As  linear  elasticity  theory  is  used  in  this  investigation, 
the  second-order  elastic  constants  or  the  lattice  and  continuum  regions  require 
matching.  Material  properties  of  single  crystal  a-silicon  are  listed  in  table  2  [31]. 


4.  Numerical  implementation 

In  this  paper,  a  tension  model  is  used  in  coupled  MD/MPM  simulations.  Using 
the  seamless  coupling  algorithms  described  in  section  3,  a  coupled  MD/MPM  code 
has  been  developed  for  multiseale  simulations.  The  computational  scheme  or  the 
How  chart  used  for  the  coupled  MD  and  MPM  simulation  of  silicon  under  tension 
is  shown  in  figure  5.  To  generate  the  tension  model,  initial  position  data  must  be 
assigned  first  to  all  atoms  and  points  in  both  MD  and  MPM  regions,  which  include 
atoms/points  in  the  MD/MPM  transition  regions.  Loading  and  boundary  conditions 
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Figure  5.  Coupled  MI)  and  MPM  computational  flow  chan. 
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MPM  region 

MD  region 

MPM  region 

Figure  6,  Schematic  of  tensile  model  for  coupled  M  D/M  PM  simulations. 
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Figure  7,  fa)  MD  atoms  and  MPM  points  initially  overlapped  in  the  transition  region, 
(b)  MPM  points  moving  under  velocity,  (c)  MD  atoms  moving  same  positions  as  MPM  points 
(black:  MD  atoms;  grey:  MPM  points:  dash  line:  MPM  grid). 


arc  also  prescribed  for  the  entire  system.  Based  on  the  given  temperature,  MD 
computation  is  first  initiated  so  that  the  initial  velocities  for  all  atoms  can  be 
determined.  The  initial  velocities  will  provide  boundary  conditions  for  the  MPM 
region  through  the  transition  region*  Then*  MPM  computation  can  be  started  and 
MPM  results  in  the  transition  region  provide  boundary  conditions  for  the  MD 
region.  Thus,  MD  computation  can  be  iterated  and  MD  results  in  the  transition 
region  provide  new  boundary  conditions  in  MPM  computation  for  the  next  time 
step.  This  iteration  process  continues  until  the  completion  of  the  simulation.  For  the 
purpose  of  illustrating  the  method,  a  schematic  of  simple  tension  model,  which 
consists  of  one  MD  region,  two  transition  regions,  and  two  MPM  regions  with  three 
hierarchical  mesh  refinement  levels,  is  shown  in  figure  6. 

The  next  item  of  interest  is  coupling  details  in  the  transition  region.  For  the 
MPM  cells  in  the  M  D/M  PM  transition  regions  (shown  in  figure  6),  MPM  points  arc 
arranged  following  MD  atoms  at  the  atomic  lattice  sites,  which  have  the  interaction 
with  the  MD  region.  In  other  words,  MD  atoms  and  MPM  points  in  the  transition 
region  are  in  one-to-one  correspondence  and  are  initially  overlapped  at  the  same 
locations,  as  shown  in  figure  7a,  It  may  be  noted  that  the  MD  time  step  increment 
is  normally  much  smaller  than  that  in  MPM,  so  that  MD  and  MPM  computations 
have  to  be  matched  well  in  coupling  simulations.  We  use  adaptive  time  step 
increments:  in  the  transition  region  wherein  the  MD  time  step  increment  and  MPM 
time  step  increment  arc  the  same.  For  continuum  region,  the  time  step  increment 
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gradually  increases  for  the  area  farther  away  from  the  atomic  region.  In  this  simple 
model,  we  chose  the  same  time  step  increment  for  both  MD  and  MPM 
computations.  For  time  integration,  the  time  step  increment  is  A/— I.OISfs 
<  1 0 “ 1 5 s>_  The  steps  involved  in  coupled  MD/MPM  simulations  are  as  follows: 

Step  I:  Initial  positions  ra  for  all  atoms  in  the  MD  region  and  all  points  in  the 
MPM  regions,  which  include  atoms/points  in  the  MD/MPM  transition  regions, 
are  given  according  to  the  dimensions  of  the  tension  model  and  the  crystal  structure 
of  the  material.  At  the  absolute  temperature  MD  computation  is  initiated  and 
initial  velocities  viU  for  all  MD  atoms  are  calculated.  The  initial  velocities  at  atoms 
are  applied  to  the  overlapped  MPM  points  in  the  transition  regions  as  a  boundary 
condition  for  MPM  regions,  as  shown  in  figure  7b. 

Step  2:  MPM  computation  is  conducted  at  the  time  step  /0  and  new  positions  r( 
and  velocities  V|  arc  obtained  for  all  MPM  points.  The  new  positions  at  points  arc 
applied  to  the  overlapped  MD  atoms  in  the  transition  regions  as  a  boundary 
condition  for  MD  simulation  as  shown  in  figure  7c. 

Step  3:  MD  computation  is  performed  at  the  time  step  fo  and  the  updated 
positions  rr,  and  velocities  \\  are  obtained  for  all  atoms.  The  updated  velocities  at 
atoms  are  applied  to  the  overlapped  MPM  points  in  the  transition  regions  as  a  new 
boundary  condition  for  MPM  simulations  at  the  next  time  step  t\. 

Step  4:  Repeat  the  same  algorithm  to  steps  2  and  3  in  turns  until  the  entire 
computation  for  the  tension  model  is  implemented  by  means  of  coupled  MD  and 
MPM  approach. 

During  coupling  of  MD  with  MPM  simulations,  MPM  always  provides  the 
updated  positions  for  MD  atoms  in  the  transition  regions,  while  MD  in  turn 
provides  the  updated  velocities  for  MPM  points  in  the  transition  regions.  It  should 
be  noted  that  following  this  algorithm  from  step  1  to  step  4,  both  atoms  for  MD  and 
points  for  MPM  in  the  transition  region  naturally  maintain  displacement  compat¬ 
ibility  so  that  strains  arc  compatible  as  well.  However,  the  stress  compatibility  cannot 
be  ensured  automatically.  Additionally,  mass  mismatch  exists  for  both  MD  atoms 
and  MPM  points  in  the  transition  region,  since  mass  determined  by  the  mass  density 
and  the  occupied  volume  is  assigned  to  each  point  in  the  MPM  region.  To  ensure 
complete  compatibility  of  both  strain  and  stress  for  seamless  coupling  between  MD 
and  MPM,  non-local  elasticity  theory  is  used  in  this  study.  Based  on  this  theory,  for 
homogenous,  linear  elastic  materials,  only  the  second-order  elastic  constants  require 
matching  between  atomistic  and  continuum  regions.  In  this  investigation,  wc  take 
clastic  constants  for  continuum  region  from  the  lattice  theory  and  use  the  same  mass 
for  both  MD  atoms  and  MPM  points  in  the  transition  regions. 


5.  Results  and  discussion 

Figure  8  is  a  3D  coupled  MD/MPM  model  at  the  initial  stage  oT  tensile 
testing  showing  the  MD  region,  MD/MPM  transition  regions,  and  MPM 
regions.  There  arc  three  levels  in  the  MPM  region.  The  dimensions  of  this  model 
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Figure  8.  Coupled  MD.  MPM  model  showing  llic  initial  siage  of  tensile  icsiitig 


are  32.6  A  x  32.6  A  x  222.8  A.  There  are  290 1  atoms  in  the  MD  region  and  9280 
material  points  in  the  MPM  regions,  which  include  atoms/points  in  the  MD/MPM 
transition  regions.  In  the  transition  regions.  MD  atoms  and  MPM  points  are  initially 
overlapped  in  one-to-one  correspondence  and  coincide  with  the  atomic  lattice.  In  the 
MD  region,  the  crystal  is  set  up  with  cube  orientation  [001]  and  the  tensile  loading 
is  applied  along  the  [001]  direction  under  controlled  velocity.  The  initial  temperature 
is  293  K  Tor  the  MD  simulations.  The  elastic  constants,  the  Young's  modulus,  the 
Poisson's  ratio,  and  other  relevant  parameters,  given  in  table  2,  arc  used  in  the  MPM 
regions.  The  coupling  simulations  on  the  tension  model  are  conducted  for  the  silicon 
work  material  at  different  strain  rates,  Lc,  44,88 ns~‘,  53,86ns”1,  62.84 ns-1. 
67*33 ns'*,  80.79 ns-1,  and  89.76ns”1.  It  may  be  noted  that  very  high  strain  rates 
are  used  of  necessity  to  minimize  the  computational  time  for  coupling  simulations. 
For  the  coupling  simulation  at  a  strain  rate  of  44.88  ns”1,  the  total  simulation  time 
was  4ps  (!(rt:s)  and  the  time  step  increment  used  was  1.018 fs.  It  took  ^3 h  of 
computational  time  using  a  PC  with  a  2.8 GHz  processor. 
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5. ! .  Deformation  ant I  failure  mechanism  under  tensile  loadini ; 

Figure  9  shows  snapshots  oT  coupling  simulation  Tor  the  entire  model  at  various 
stages  of  tensile  testing  at  a  strain  rate  or  44.88  ns-1  and  figure  10  shows  close-up 
views  ol’  figure  9.  The  snapshots  of  coupled  MD/MPM  simulation  show  that  the 
tensile  specimen  experiences  with  increasing  elongation  in  elasticity,  generation  of 
dislocations  and  plasticity  by  slip,  void  formation  and  propagation,  formation  of 
amorphous  structure,  necking,  and  final  fracture  in  the  MD  region  while  only  elastic 
deformation  is  present  in  the  MPM  regions  at  either  end.  In  the  initial  stage,  there 
is  uniform  stretching  with  clastic  deformation  and  no  dislocations  occurring  in  the 
crystal  lattice  (see  figure  1  lb).  The  limit  of  elasticity  will  be  reached  until  the  strain 
reaches  approximately  0.1  (see  figure  15).  With  further  increase  in  deformation. 


Y 

(a)  (b)  (c)  (cl)  (e) 

Figure  9.  Snapshots  of  MD/MPM  coupling  simulation  at  a  strain  rate  of  44JHn$  1  and 
at  different  lime  steps  (;0  0.0 1  ps,  (h)  I  73  ps,  (c)  3*05  ps.  (d)  3.25  ps  and  (c)  5.5  ps. 
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Figure  10.  Magnified  view  of  the  snapshots  (figure  9)  of  MD/MPM  coupling  simulation 
at  a  strain  rale  of  44.88ns-1  and  at  different  time  steps  (a)  0.01  ps,  (b)  1.73ps,  (c)  3.05 ps. 
(d)  3.25 ps  and  (e)  5.5  ps, 


the  dislocations  are  seen  to  be  generated  locally  with  their  densities  gradually 
increasing  in  the  plastic  deformation  region.  Slip  takes  place  along  [1 1 1]  at  about  45“ 
to  the  loading  direction  (see  figure  I  )c).  Plasticity  by  slip  will  last  until  the  formation 
of  voids  and  semi-amorphous  structure  (see  figure  lid),  Ultimate  tensile  stress 
is  reached  at  a  strain  of -0.3  (sec  figure  15).  Further  increase  in  elongation  results 
in  local  interactions  or  rearrangements  of  silicon  atoms,  resulting  in  somewhat 
amorphous  structure  (see  figures  1  le  and  1 10-  The  deformation  is  then  concentrated 
in  a  narrow  region  resulting  in  necking  (see  figure  lOd).  Finally,  separation  of  the 
tensile  specimen  occurs  (see  figure  10c). 

Figures  10a -e  arc  3D  snapshots  of  MD  simulation  at  higher  magnification  (of 
figure  9)  showing  different  stages  of  tensile  testing  at  a  strain  rale  of  44.88  ns  1  and  at 
different  time  steps.  Slip  taking  place  at  45°  to  the  loading  direction  can  be  clearly 
seen  (figures  1 0b  and  10c).  The  formation  of  amorphous  structure  in  the  central 
region  and  void  formation  ean  be  clearly  seen  in  figure  lOd.  Figure  1 0e  shows 
concentration  of  amorphous  structure  in  the  central  region  followed  by  necking  and 
separation.  This  corresponds  to  the  steepest  slope  in  figure  15  leading  to  final  rupture 
of  the  sample. 

Figures  11  14  show  snapshots  in  the  mid-section  at  different  stages  of  tensile 
testing  at  different  strain  rates  from  44.88 ns-1  to  89.76ns-1  and  at  different  time 
steps.  Initially  the  deformation  is  predominantly  elastic.  This  corresponds  to  steep 
clastic  slope  in  figure  15.  This  is  followed  by  slip  on  the  preferred  |l  1 1]  direction. 
Further  increase  in  strain  leads  to  the  formation  of  inhomogeneous  structure  leading 
to  amorphization  in  the  entire  necked  region  (sec  atoms  marked  by  circles  in 
figure  lid).  The  similar  umorphization  ean  be  seen  in  figures  1 2d,  13d  and  I4d. 
Ultimately,  separation  of  the  sample  takes  place  with  drastic  reduction  in  stress. 
With  increase  in  strain  rate,  the  crystallinity  of  the  silicon  is  maintained  up  to  larger 
strains.  Also,  with  increase  in  strain  rate,  the  strain  at  fracture  increases.  It  may  be 
noted  that  necking  that  occurs  before  final  failure  for  silicon  is  significantly  less  as 
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(d)  <e>  (f) 

h'ignrc  1 1.  Snapshots  of  M  D/M  PM  coupling  simulation  at  a  strain  rate  of  44.«Sins-1  at 
different  time  steps  (a)  0.01  ps,  (b)  1.52ps.  (c)  2.95  ps.  (d)  3.05  ps.  (e)  3.l5ps  and  (f)  3.56 ps. 


compared  lo  necking  in  ductile  materials,  such  as  copper,  aluminium  [24],  The 
catastrophic  failure  leads  to  a  sudden  drop  in  the  stress  strain  curves  (sec  figure  15). 


5.2.  Effects  of  strain  rale  on  material  properties 

The  effects  of  strain  rate  on  material  properties  have  also  been  investigated  for  the 
coupled  M D/M  PM  simulations.  Under  different  strain  rates,  slip  essentially  lakes 
place  at  about  45  with  respect  to  the  loading  direction  followed  by  the  formation 
of  amorphous  struct  tire  and  generation  of  voids  in  the  necked  region  and  their 
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Figure  12.  Snapshots  of  M  D/M  PM  coupling  simulation  at  a  strain  rate  of  67,33  ns  1  at 
different  time  steps  (a)  0.0 1  ps,  (h)  l.24ps,  fc)  2.58  ps.  (d)  2.87  ps,  {e)  2.95  ps  and  (f)  3.00 ps. 


coalescence,  leading  to  final  failure.  The  engineering  stress  strain  curves  at  different 
strain  rates  are  shown  in  figure  15.  They  show  that  material  behaves  differently 
at  different  stages  including  clastic  and  plastic  deformation,  and  final  rupture  at 
different  strain  rates.  Thus,  the  strain  rate  affects  material  properties  significantly. 
However,  before  the  strain  readies  Mil,  there  is  no  dislocation  generation  and  the 
initial  part  of  engineering  stress-strain  curve  is  essentially  elastic  at  different  strain 
rates  in  the  range  from  44.88  ns  5  to  89.76  ns^1.  From  the  slope  of  the  stress  strain 
curves,  the  Young's  modulus  is  determined  as  M2!  GPa,  which  is  very  dose  to  that 
obtained  from  the  lattice  theory  [31].  As  expected,  the  elastic  modulus  is  nearly 
constant  and  is  not  affected  by  the  strain  rate. 
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Figure  13.  Snapshots  of  M  D/M  PM  coupling  simulation  at  a  strain  rale  of  80*79  ns"1 
different  lime  steps  fa)  0,01  ps.  (b)  1  26psT  (c)  2,4Kps.  (d)  2*79  ps,  (e)  2,85  ps  and  (f)  2.9ips 
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The  curve  in  figure  16a  shows  the  effect  of  the  strain  rate  on  the  tensile  strength 
for  silicon.  The  tensile  strength  increases  from  ^15  to  ^18GPa  with  increase  in 
the  strain  rate  from  44*88 ns-1  to  89*76ns_l,  Figure  16b  shows  the  variation  of  the 
ultimate  strain  with  strain  rate.  It  shows  that  the  ultimate  strain  increases  rapidly  up 
to  '-0,54  as  the  strain  rate  increases  from  44.88  ns-1  to  67.33  ns“\  and  then  increases 
gradually  up  to  -0*57  as  the  strain  rate  increases  from  67,33  ns"1  to  89.76  ns-1. 

The  ultimate  strain  vs.  the  strain  rate  curve  changes  slope  at  a  strain  rate 
of  '-67.33  ns-1.  Observation  of  the  snapshots  indicates  that  defect  initiation  and 
coalesce  nee  arc  somewhat  different  across  the  strain  rate.  At  lower  strain  rates,  he. 
below  67.33  ns-1  (c.g*  at  44.88  ns“\  as  shown  in  figure  I  Id),  voids  are  formed  in  the 
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Kteurc  14,  Snapshots  of  MD/MPM  coupling  simulation  at  a  strain  rale  of  H9.76 ns" 1  at 
different  lime  steps  (a)  0.01  ps,  fb)  1.29  ps,  (c)  2.4  ps.  (d)  2.434  ps  (e)  2.7Rps  and  (0  2.8ft  ps. 


central  region  of  the  tensile  specimen,  These  voids  coalesce  to  larger  defects  resulting 
in  necking  and  ultimate  failure.  While  at  higher  strain  rates  (67.33  ns'1  or  higher,  e.g. 
at  89.76  ns  1 ,  as  shown  in  figures  1 4d  and  1 4c),  defects  are  formed  near  the  surface  at 
about  1/4  and  3/4  of  the  MD  region  (sec  regions  marked  by  circles  in  figure  14). 
These  defects  grow  in  size  and  cover  the  entire  area,  leading  to  decreased  load  hearing 
capacities  at  these  locations,  resulting  in  some  bulging,  as  shown  in  figures  14d~e. 

Compared  with  failure  at  the  lower  strain  rates  (e.g.  figure  lid),  there  are  more 
defect  sites  at  higher  strain  rate  (figure  14c).  Once  the  defects  at  multiple  sites 
coalesce,  necking  occurs,  leading  to  a  quick  rupture  of  the  material.  The  change  in 
the  slope  of  the  ultimate  failure  strain  shown  in  figure  1 6h  is  most  likely  the  result  of 
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Figure  1 5,  Engineering  stress  strain  curves  from  M  D/M  PM  coupled  simulations  for  silicon, 

the  change  in  defect  formation  mechanism  from  relatively  low  to  high  strain  rates* 
However,  it  is  not  clear  at  this  stage  the  underlying  reason  for  a  change  in  the  defect 
formation  mechanism  across  the  strain  rates.  Further  investigation  is  needed  to 
explore  this  feature. 


6.  Conclusions 

•  A  new  computational  approach  has  been  developed  for  coupling  molecular 
dynamics  (MD}  with  material  point  method  (MPM)  for  multiscalc  simulations 
from  atomistic  to  continuum.  In  the  MPM  computation,  the  hierarchical  mesh 
refinement  technique  is  introduced  to  scale  down  from  the  continuum  level  to 
the  atomistic  level  so  that  MPM  can  be  coupled  with  MD,  A  Tersoff-type, 
three-body  potential  was  used  in  MD  computations  for  silicon.  Seamless 
coupling  has  been  achieved  through  a  transition  region  between  MD  and 
MPM  regions. 

•  Coupled  M  D/M  PM  simulations  of  uniaxial  tension  at  multiscale  were 
conducted  on  silicon  at  different  strain  rates.  Numerical  results  show  that 
the  entire  tension  process  involves  elongation  in  the  elastic  region  followed  by 
dislocation  generation  and  slip  by  plastic  deformation,  void  formation  and 
propagation,  and  finally  necking  and  formation  of  an  amorphous  structure 
resulting  in  final  fracture, 

•  From  coupling  simulations,  engineering  stress  and  strain  curves  for  silicon 
were  determined  at  different  strain  rates,  and  the  effects  of  strain  rate  on 
material  properties  were  also  investigated.  For  silicon,  both  the  tensile  strength 
and  the  ultimate  strain  increase  with  increase  in  the  strain  rate. 
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Figure  1ft.  Effects  of  strain  rale  on  material  properties  obtained  from  coupled  M  D/M  PM 
simulations  for  silicon,  (a)  Tensile  strength  vs*  strain  rate,  (b)  Ultimate  strain  vs.  strain  rate. 
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The  material  point  method  (MPM)  has  demonstrated  its  capa¬ 
bilities  in  (he  simulation  of  impact/contact/penetration  and  inter* 
facial  crack  growth  problems.  Because  of  the  use  of  material  points 
in  the  description  of  a  continuum,  consistent  with  the  particle  de¬ 
scription  (atoms)  using  molecular  dynamics  (MD),  it  is  natural  to 
couple  MPM  with  MD  for  simulation  from  atomistic  to  continuum 
levels.  However,  in  addressing  plane  stress/plane  strain  problems, 
t  he  M  PM  algorithm  and  simulation  examples  available  in  literature 
use  a  regular  grid  mesh  with  uniform  square  cells  and  enforce  veloc¬ 
ity  and  displacement  continuities  through  its  background  grid,  re¬ 
sulting  in  limitations  in  dealing  with  stress  concentration,  inclined 
dislocations  and  inclined  crack,  etc.  In  this  article,  an  irregular 
mesh  is  implemented  in  MPM  to  eliminate  the  limitations  result¬ 
ing  from  the  use  of  a  regular  mesh.  The  ray-crossing  algorithm 
is  employed  to  determine  which  cell  a  material  point  belongs  to 
after  deformation  for  interpolation  and  extrapolation  of  variables 
between  material  points  and  grid  nodes.  As  an  example  to  demon¬ 
strate  the  capability  of  the  MFIVt  using  irregular  mesh,  the  stress 
held  in  a  continuum  with  an  inclined  crack  is  determined  using 
arbitrary  quadrilateral  cells  in  the  background  grid  mesh.  The  use 
of  irregular  mesh  in  MPM  was  validated  by  comparing  MPM  re¬ 
sults  with  ABAQUS/Explicit  simulation.  The  proposed  method  of 
using  irregular  mesh  will  be  an  essential  element  in  using  MPM 
to  couple  with  atomistic  scale  simulation  so  that  MPM  can  ad* 
dress  inclined  dislocations  and  cracks  emanating  from  the  atomistic 
simulation. 


1.  INTRODUCTION 

Material  point  method  (MPM)  was  pioneered  by  Sul  sky 
et  al.  [1-4]  to  address  dynamic  problems  in  solid  mechan¬ 
ics,  It  has  demonstrated  its  capabilities  in  the  simulation  of 
i m pac L/c on ta ct/pcnc t rati o n  [3-6]  and  inter  facial  crack  growth 
problems  [7.  8].  lu  MPM.  a  material  continuum  is  discretized 
into  a  finite  collection  of  infinitesimal  material  points.  Two  de¬ 
scriptions  arc  used  in  MPM’ — one  based  on  a  collection  of  mate¬ 
rial  points  (Lagrangian)  and  the  other  based  on  a  computational 
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background  grid  (Eulcrian).  The  background  rectangular  grid  is 
used  solely  for  the  calculations.  The  material  points  are  followed 
throughout  the  deformation  of  a  solid  to  provide  a  Lag  ran  gi  an 
description  and  the  held  governing  equations  are  solved  at  back¬ 
ground  grid  nodes  so  that  MPM  is  not  subjected  to  mesh  entan¬ 
glement.  As  a  result,  MPM  takes  advantage  of  both  Eulcrian 
and  Lagrangian  descriptions  to  give  it  the  capability  of  handling 
large  deformations  in  a  more  natural  manner  so  that  mesh  lock¬ 
up  problems  present  in  FEM  is  avoided.  Because  MPM  uses 
material  points  to  represent  a  material  continuum,  and  the  mate¬ 
rial  points  are  a  description  consistent  with  the  particle  descrip¬ 
tion  in  molecular  dynamics  (MD),  coupling  between  MPM  and 
MD  is  more  naturally  compatible.  Also,  parallel  computation  is 
relatively  straightforward  because  the  use  of  a  grid  structure  in 
MPM  is  consistent  with  parallel  computing  grids.  Additionally, 
for  problems  involving  contact  [5,  61,  MPM  provides  a  no-slip 
contact  algorithm  to  avoid  the  penetration  between  two  bodies 
based  on  a  common  background  mesh. 

When  MPM  is  coupled  with  MD  for  multiscale  simulation, 
inclined  dislocations,  and  cracks  emanating  from  the  MD  simu¬ 
lation  will  need  to  be  modeled  by  MPM  at  the  continuum  level 
Conventional  MPM  usually  uses  regular  grid  mesh,  which  con¬ 
sists  of  uniform  square  cells  in  2D  case,  leading  to  two  lim¬ 
itations.  One  is  that  conventional  MPM  cannot  address  stress 
concentration  issues.  Another  one  is  that  conventional  MPM 
cannot  handle  inclined  crack  problems,  Based  on  conventional 
MPM,  only  horizontal  or  vertical  crack  can  be  generated.  How¬ 
ever,  for  most  materials,  there  exists  sliding  after  deformation 
so  that  inclined  cracks  are  generated.  Therefore,  a  new  algo¬ 
rithm  in  MPM  with  irregular  mesh  needs  to  he  developed  to 
solve  the  above  two  issues.  However,  publications  in  the  open 
literature  on  MPM  have  not  addressed  the  issue  of  using  irreg¬ 
ular  mesh  directly.  The  objective  of  this  article  is  to  implement 
irregular  mesh  and  develop  a  new  algorithm  within  the  frame¬ 
work  of  conventional  MPM  for  addressing  inclined  dislocations 
and  cracks  for  applications  such  as  multi  scale  simulation  using 
MPM/MD. 

In  this  article,  a  method  of  using  an  irregular  mesh  in  MPM 
is  developed  to  enable  the  use  of  arbitrary  quadrilateral  cells 
in  the  background  grid  mesh  for  handling  and  inclined  cracks 
and  stress  concentration.  Transformation  between  local  (natural) 
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coordinates  and  global  (irregular)  coordinates  used  in  the  irreg¬ 
ular  mesh,  similar  to  isoparametric  elements  in  finite  element 
is  employed.  MPM  needs  to  identify  the  locations  of  material 
points  with  respect  to  cells.  In  the  case  of  the  use  of  irregular 
mesh,  a  ray  crossing  technique  is  used  to  identify  whether  the 
material  points  are  within  an  irregular  mesh  or  not.  The  use  of 
irregular  mesh  is  then  implemented  in  the  MPM  algorithm  as 
developed  by  Sulsky  et  al.  [  1  -4].  Validation  was  first  made  with 
a  tension  problem  using  two  MPM  models  (with  regular  and  ir¬ 
regular  meshes)  through  comparison  with  FEM  results  obtained 
from  ABAQUS  explicit  code.  An  inclined  crack  problem  is  also 
solved  as  an  example  to  demonstrate  the  capability  of  the  arbi¬ 
trary  quadrilateral  cells  in  the  MPM  algorithm.  The  stress  and 
deformation  fields  for  the  tensile  model  with  an  inclined  crack 
are  determined  as  a  function  of  time  from  dynamic  MPM  sim¬ 
ulations.  For  mixed  mode  fracture,  fracture  parameters  such  as 
energy  release  rates  and  stress  intensity  factors  arc  calculated 
using  ihc  virtual  crack  closure  method  and  the  displacement 
extrapolation  method.  The  same  problem  is  also  solved  using 
dynamic  FEM  to  validate  the  method  of  using  irregular  mesh  in 
MPM. 

2.  MATERIAL  POINT  METHOD 

Material  point  method  (MPM)  was  originally  developed  by 
Sulsky  and  Schreycr  of  the  University  of  New  Mexico  and  Chen 
of  the  University  of  Missouri,  under  the  support  of  Sandia  Na¬ 
tional  Laboratories  [L  2]  for  addressing  a  range  of  solid  me¬ 
chanics  problems.  Figure  I  is  a  schematic  of  the  MPM  for  2D 
calculations  in  solid  mechanics.  The  solid  line  is  the  outline  of 
the  body  to  be  analyzed.  The  black  dots  are  the  material  points. 
Each  material  point  is  given  an  initial  mass  consistent  with  the 
mass  density  and  volume  of  the  material.  Material  parameters 
such  as  mass,  displacement,  velocity,  stress,  strain,  internal  en¬ 
ergy,  and  temperature  are  assigned  to  each  material  point  ac¬ 
cording  to  the  material  it  represents.  As  the  numerical  solution 
proceeds,  the  material  points  are  tracked  and  their  states  updated 
so  that  they  carry  Lhe  complete  solution.  Information  from  the 
material  points  is  transferred  to  background  computational  grid 
nodes.  The  continuum  equations  are  discretized  at  grid  nodes 


using  standard  finite  difference  or  finite  element  approach.  The 
solutions  at  grid  nodes  are  then  used  to  update  the  position  and 
velocity  of  the  material  points.  Strain  increments  computed  on 
the  grid  are  interpolated  to  the  material  points  and  used  in  con¬ 
junction  with  constitutive  equations  to  update  stress  states  for 
each  material  point. 

The  governing  equation  in  weak  form  is  [9.  10] 

/  w  TdS  +  /  pw  bdQ 
Jn7  Jn 

=  j  pw  ad  ft  -f  /  per3  ■  V'wdQ  CO 

Jn  Jn 

where  a*  =  a/p  is  the  specific  stress  tensor  a  is  the  Cauchy 
stress  tensor,  b  is  the  body  force,  a  is  the  acceleration,  and  p 
the  mass  density,  w  is  the  test  function  which  is  an  arbitrary 
spatial  function.  Differential  volume  and  surface  elements  are 
denoted  by  d£2  and  dS ,  respectively.  Displacement  boundary 
conditions  are  defined  by  u  —  g  on  dS2fJ,  and  traction  bound¬ 
ary  conditions  are  given  by  0  *  n  =  t  on  SQr,  where  n  is 
the  unit  outward  normal  vector  at  the  boundary  surface  and 
dQ  =  U  . 

A  material  continuum  is  divided  into  a  finite  collection  of  dis¬ 
crete  infinitesimal  regions  Q P(p  =  I, _ Np)  called  material 

points.  Each  material  point  is  assigned  a  mass  mp  in  Qr,  where 
m p  =  fQ  p{x)dQ  and  Q  =  UQp>  Mass  density  can  then  be 
approximated  as  a  sum  of  material  point  masses  using  a  Dirac 
delta  function  p(x,  t )  =  -  Xrit )), 

All  variables  represented  by  such  as  the  coordinates, 
displacements,  velocities,  and  accelerations  need  to  be  trans¬ 
ferred  between  grid  nodes  and  material  points  using  the  shape 
functions  [10], 


<t>(x)  =  £>/V,(x) 


(2) 


where  Nn  is  the  number  of  nodes  in  the  background  grid  and  tfi, 
refers  to  the  ;'th  nodal  value  of  cp(x). 

The  numerical  solution  will  be  obtained  at  a  discrete  set  of 
times,  r* ,  where  k  —  1 , . , , ,  tf  Based  on  Eq.  ( 1 )  and  the  interpo¬ 
lation  scheme  in  Eq.  (2),  the  weak  form  of  momentum  equations 
reduces  to 


/=> 

where  the  mass  matrix  is 

p=\ 


(4) 
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lhe  internal  force  ami  the  external  force  on  the  node  /  arc 

r  ^t**.#*)  and 

p~l 

r  =  ^+E«/.»>(x‘P.o^(x‘)  (5) 

p=i 

and  the  discrete  applied  traction  is  tJ  =  fm  Nj{x) t(x,  tk)dS> 
The  grid  nodal  acceleration*';  from  Eq.  (3)  are  then  used  lo 
update  the  position,  velocity,  stress,  strain,  and  temperature  of 
the  material  points.  In  this  investigation,  the  update -stress-last 
(USL)  algorithm  is  used  [11],  Lc.,  the  stress  on  the  material 
points  is  updated  at  the  end  of  each  lime  step,  using  the  strain 
increment  calculated  from  the  updated  material  point  velocities 
interpolated  to  the  grid, 

3,  THE  USE  OF  IRREGULAR  MESH  IN  MPM 

Figure  2  shows  an  arbitrary  quadrilateral  cell  with  initial  ma¬ 
terial  points  in  the  local  (natural)  and  global  coordinate  systems 
in  the  MPM  algorithm  with  an  irregular  mesh.  At  the  initial  time 
step*  material  points  are  uniformly  distributed  at  quarter  loca¬ 
tions  in  the  cell  in  terms  of  the  local  (natural)  coordinate  sys¬ 
tem,  Each  material  point  is  assigned  a  quarter  of  the  cell  mass* 
Furthermore,  it  is  necessary  to  determine  which  cell  in  the  back¬ 
ground  grid  mesh  a  material  point  belongs  to  after  deformation 
so  as  to  implement  the  interpolation  and  extrapolation  of  vari¬ 
ables  between  material  points  and  grid  nodes.  In  this  study,  the 
ray-crossing  algorithm  1 12]  is  employed  to  determine  whether 
the  material  points  are  inside  or  outside  of  the  arbitrary  quadrilat¬ 
eral  cells.  Figure  2  also  shows  Lhe  principle  of  the  ray-crossing 
algorithm  for  an  arbitrary  quadrilateral  cell,  To  determine  the 
status  of  a  material  point,  we  consider  a  horizontal  ray  ema¬ 
nating  from  this  material  point  to  the  right  If  the  number  of 
times  this  ray  intersects  with  the  sides  of  the  polygon  is  even 
(including  zero),  the  material  point  is  outside  of  the  polygon.  If 


PIG,  2  Aii  urhiiniry  quadrilateral  cell  with  initial  materia!  points  and 
sc  hemal  it:  of  the  ray-crossing  algorithm. 


the  number  of  intersections  is  odd,  the  material  point  lies  inside 
the  polygon.  For  example,  rays  emanating  from  sample  material 
points,  A  and  C,  considered  in  Figure  2  intersect  with  the  sides 
of  polygon  zero  and  two  times,  respectively,  and  are  thus  outside 
the  polygon*  Material  point  B  intersects  with  the  side  one  time 
and  is  thus  inside  the  polygon. 

At  each  time  step  iteration  in  MPM  compulation,  all  vari¬ 
ables,  such  as  displacement,  velocity,  acceleration,  strain,  stress, 
and  force,  need  to  be  transferred  between  grid  nodes  and  ma¬ 
terial  points  using  the  shape  functions.  In  this  transformation, 
the  local  (natural)  coordinates  for  updated  positions  of  material 
points  are  required  in  the  shape  function  used  in  Eq.  (2)  (see 
Eq,  (7)  for  further  details)  after  the  particle  locations  in  the  grid 
cells  have  been  determined.  In  a  2D  situation,  the  shape  function 
is  of  Lhe  same  form  as  that  used  in  the  finite  element  analysis 
[ )  0|  and  can  be  expressed  as 

Nt  =  -(1  +  £o)(l  +  r|0)  £41  = 

4  (6) 

no  =nn,  U  =  1.2, 3,4) 

and  the  local  coordinates  for  four  nodes  of  a  cell  f(£* ,  T|, )  are 
1(-1,  —  I);  2(1,  —  1);  3(1 ,  t>;4< — 1,  l).  The  global  coordinates 
of  the  material  points  (xpfyp)  could  be  determined  in  terms 
of  the  global  coordinates  of  cell  nodes  Up,  y()  which  material 
points  are  associated  with, 

N  N 

xl>  =  5Z  XiNi^P'  V  yp  =  H  >'r N,{Lp.  riy,)  O) 

#1=1  #1-1 

where  qp)  are  the  local  (natural)  coordinates  of  material 
points  and  N  —  4  for  four  node  cells.  From  Eqs*  (6)  and  (7),  we 
have 

*,.  =  -  4)0  -  np)jri  +  jU  +  £p)U  - 

+  ^{1  +  tp)(l  +  T|ji)X3  +  ^(1  —  Lp)(  l  +  Hp)v4 

yP  =  -  in)0  -  Up)?'  +  ^(1  +M1  - 

+  -(1  +  £>j#)n  +  +  j(l  —  £/#)(  I  +  ri/Jyq 

(-1  <  lP  5  1  and  -  I  <  rij,  <  1).  (8) 

Let 

a,  =  +  x2  +  jr 3  -  .n  0i  =  -yi  +  yi  +  yy  -  yA 

t*2  —  — *1  “  +  xy  +  £4  =  —  yi  —  yi  "F  yy  +  y4 

ay  =  X\  -  +  Xy  -  *4  $y  =  )p|  “  VJ  +  >3  -  >p4 

OL)  =  4xp  ~  (A*j  +  X2  +  xy  +  Xa)  04  =  4yp 

-Oh  +  yi  +  >'4) 
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Eq.  (8)  becomes 

OH  =  ait,,  +  a 2r)p  + 

3*  =  3i£,P  +  32r|p  +  PsMp 
From  Eqs,  (9,  1 0),  we  have 

LP  =  i«4p3  -  Pj«,  -  (P,a2  -  ajPi)'lp]/(3jai  -  «3 0 1  > 
and 

fotj(«2p,  -  33«j)/(Ps«i  -  «jPi))np 

+  [«l(«2P3  -  02«3V(03«I  -  OMPiJ'XXjV 


- «i  -  aj(a»p3  -  04«j>/(03«i  -  «3Pi)lnp 
+  [«4  -  ai(aaP)  -  P4«iV(P3al  -  ajPOl  =  0  (12) 

Solving  the  above  quadratic  Eq,  (12),  two  roots  can  be  ob¬ 
tained  The  absolute  value  of  one  of  the  roots  r|pl  and  r\p2  must 
be  less  than  "l"  and  whichever  value  lies  within  'T*  is  taken  as 
Then,  the  value  of  r|p  is  substituted  into  Eq.  (1 1)  to  get 
Thus,  the  local  (natural)  coordinates  for  the  position  of  material 
points  are  determined.  The  updated  local  coordinates  will  be 
used  in  Eqs.  (2),  (4)  and  (5)  to  implement  the  interpolation  and 
extrapolation  of  all  variables  between  grid  nodes  and  material 
points 
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(b) 


FIG,  4.  Geometry,  boundary  and  loading  conditions  for  two  tension  models 
(uncrackcd  and  cracked  plaits) 


FIG  3.  Schematic  of  Irwin's  crack  closure  method. 
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4.  FRACTURE  PARAMETERS  FOR  A  MIXED  MODE 

CRACK 

To  demonstrate  the  capability  of  the  MPM  with  an  irregular 
mesh,  an  inclined  crack  problem  in  mixed  mode  fracture  is  con- 
sidered.  The  theoretical  framework  on  the  numerical  analysis  of 
mixed  mode  fracture  is  introduced  in  this  section. 

We  next  summarize  some  results  on  using  MPM  to  model 
crack  problems,  as  well  as  some  work  on  mixed  mode  crack 
problems.  Tan  and  Naim  [7,  8]  recently  applied  MPM  for  mod¬ 
eling  fracture.  They  utilized  MPM  to  model  a  crack  in  a  plane 
problem  and  computed  dynamic  energy  release  rales.  They  also 
considered  the  condition  of  no  interaction  of  material  points 
between  two  free  surfaces  of  the  crack  by  separating  one  line 
into  two  surfaces.  Conventional  MPM  uses  a  regular  grid  mesh 
in  which  all  MPM  cells  are  uniform  square  ones  [11,  13-20]. 
This  leads  to  two  major  limitations;  (1)  It  cannot  address  stress 


lb) 


Id 

FIG.  5-  MPM  :ind  FEM  meshes  for  ihe  tension  model  without  a  crack.  la) 
Regular  MPM  mesh,  (h)  Irregular  MPM  mesh  (c)  FEM  mesh. 


(b) 


FIG.  6.  MPM  and  FEM  meshes  for  the  tension  model  with  an  inclined 
crack,  (a)  MPM  mesh  with  an  inclined  crack.  (Jb)  FEM  mesh  with  an  inclined 
crack. 

concentration  problems  as  the  adaptive  rnesh  cannot  be  used 
in  areas  witt^Jiigh  stress  gradients;  and  (2)  It  cannot  address 
inclined  crack  problems  because  the  use  of  regular  (uniform 
square)  cells  allows  cracks  only  parallel  and  perpendicular  to 
the  grid.  However*  in  reality*  cracks  can  exist  in  any  arbitrary 
orientation.  While  a  single  crack  in  a  structure  can  be  aligned 
with  the  coordinate  axes,  un- aligned  multiple  cracks  and  crack 
kinking  cannot  be  all  brought  to  align  with  the  abscissa  or  the 
ordinate.  To  accommodate  inclined  cracks  and  to  use  a  fine  mesh 
near  the  crack  and  coarse  mesh  in  the  far  held  (for  minimiza¬ 
tion  of  computational  time  while  maintaining  accuracy)  an  ir¬ 
regular  mesh  is  needed  in  MPM,  which  is  the  subject  of  this 
article. 

Mixed  mode  fracture  problems  are  commonly  encountered 
in  engineering  [21-23],  Consequently,  analysis  of  mixed  mode 

TABLE  1 

Total  time  and  time  step  increment  for  each  model 


Tot  al  simulation  Ti  me  step 

Model  time  (pj)  increment  (ns) 


MPM  with  regular  mesh 

5.1 

14 

MPM  with  irregular  mesh 

5,1 

73 

FEM 

5*1 

73 

MPM  with  crack 

5.1 

14 

FEM  with  crack 

5J 

14 
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fracture  problems  is  important  in  fracture  mechanics  fora  better 
understanding  and  control  of  fracture  leading  to  failure  of  mate¬ 
rials,  Cracks  subjected  to  mixed  mode  loading  can  be  attributed 
primarily  to  three  factors;  (l)  mixed  remote  loading,  i,e„  re¬ 
mote  normal  and  shearing  forces  acting  on  a  component  having 
a  crack  perpendicular  to  the  normal  loading  direction,  (2)  de¬ 
flected  or  inclined  crack  under  normal/uniaxial  remote  loading, 
and  (3)  mechanical  and/or  thermal  loads  combined  with  arbi¬ 
trary  restraint  conditions,  producing  a  multiaxial  loading  condi¬ 
tion  on  a  crack.  In  computational  fracture  mechanics,  there  arc 
at  least  three  methods  available  for  the  computation  of  fracture 
parameters,  energy  release  rate  and  the  stress  intensity  factor 
(SIF)  under  mixed  Mode  1  and  Mode  I)  loading  conditions  [24]. 
They  are:  (1)  Irwin's  virtual  crack  closure  method.  (2)  the  dis¬ 
placement  extrapolation  method,  and  (3)  the  direct  J-iruegral 
computation  using  the  equivalent  domain  integral  together  with 
a  mode  decomposition  scheme.  For  applications  of  fracture  me¬ 
chanics  in  engineering  structures,  Dodds  and  coworkers  [24, 
25 1  obtained  the  stress  intensity  factors  from  corresponding  J- 
integral  computations.  Bittencourt  et  al.  [26]  showed  that  for  a 
sufficiently  re  Fined  finite  element  meshes  all  three  methods  give 
essentially  the  same  results  for  linear  clastic  materials.  Usu¬ 
ally,  the  virtual  crack  closure  method  can  be  used  to  calculate 
Mode  1  and  Mode  II  energy  release  rates  and  the  total  energy 
release  rate  for  mixed  mode  fracture  using  nodal  forces  and  rel¬ 
ative  displacements  at  the  crack  tip  Among  the  three  methods, 
the  displacement  extrapolation  method  is  the  most  convenient 
method  to  calculate  Mode  [  and  Mode  M  stress  intensity  factors 


in  terms  of  relative  displacements  and  locations  along  free  crack 
surfaces. 

The  energy  release  rate  for  the  crack  propagation  can  be  cal¬ 
culated  using  the  Irwin's  virtual  crack  closure  method  [27,  28], 
The  principle  behind  Irwin's  virtual  crack  closure  technique 
is  that  if  a  crack  is  assumed  to  propagate  an  infinitesimal  in¬ 
crement,  the  energy  released  during  the  crack  growth  will  be 
equal  to  the  work  required  to  close  this  propagated  crack  to 
its  initial  crack  size.  Based  on  this  assumption,  the  total  en¬ 
ergy  release  rate  for  a  mixed  crack  growth  increment  Aa  is 
given  by 

1  fAa 

G  =  Jim  — - —  I  &uda 
2A Q  J 0 

I  I  fAa 

=  lim  - — {  I  avvA\da  +  /  a,xAuda 

A^o2Aal/0  Jo 

-t -j  cry;Au?daj  (13) 


and  for  the  2D  case.  Mode  I  and  Mode  fl  energy  release  rates 
for  a  mixed  crack  fracture  will  be 
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FIG  7.  von  Miscs  stress  contours  Tor  M  PM  ( rcgular/irregutor  meshes)  and  FEM  at  t  =  5  (a)  MPM  model  wilh  regular  mesh,  tbl  model  wiih  irregular  mesh, 

(cl  PPM  model. 
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In  numerical  methods  .the  integral  can  he  approximated  using 
the  nodal  forces  and  nodal  displacements  al  the  crack  tip.  In 
Figure  3,  the  work  required  to  close  the  propagated  crack  to  its 
original  crack  size  is  given  by 

AW  -  -  ux)  +  Fyfrf  -yK)  +  Fi(wf  -  wx)]  (15) 

where  u/*tix,yfMyx,  Wf  and  wx  are  displacement  components 
and  the  subscripts  of  T  and  ‘g*  denote  locations  belonging  to  the 
top  and  bottom  surfaces  at  the  crack  tip,  respectively,  as  shown 
in  Figure  3, 

Based  on  the  definition  of  the  energy  release  rate  in  Eq.  ( 1 3),  the 
total  energy  release  rate  can  be  expressed  as 


G  = 


AW 


(16) 


where  A  A  is  the  crack  area  increment  due  to  crack  extension 
increment,  A  a.  Thus,  the  Irwin's  virtual  crack  closure  technique 
can  be  implemented  in  both  dynamic  FEM  and  MPM  simula¬ 
tions,  Generally,  in  the  elastic  regime  there  is  a  direct  relationship 
between  the  stress  intensity  factor  and  the  energy  release  rate. 
However,  in  the  mixed  mode  situation  this  relation  cannot  be 
used  easily.  Therefore,  the  displacement  extrapolation  method 
is  used  to  determine  Mode  1  and  Mode  II  stress  intensity  factors 
in  this  study.  Elastic  solutions  for  the  displacements  at  and  near 
the  crack  tip  are  used  in  this  method.  Paris  and  Sih  [29]  gave  the 
displacements  for  linear  elastic  materials  as 


k,  FT r  M  e  301 

11  =  4G  V  2n  L  *  _  C0S  2  -  C0S  "2"  J 
Kn  rr  r  ,  .  e  .301 

-4cM(2x  +  3),m2-*,nT| 


+  OIO  (17) 


Time  (/i.v ) 


FIG.  Kn 


Comparison  of  the  .t- displacement  and  von  Mbcs  stress  at  location  A  {4.0  mm.  5.45  mini. 
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where  u,  *vt  and  tc  are  ihc  local  Cartesian  displacements,  (r,  0) 
are  the  local  polar  coordinates  with  its  origin  at  the  crack 
tip,  G  ts  the  shear  modulus,  x  =  3  -  4~v  for  plain  strain  or 
axi-symmeiric  conditions,  x  =  (3  -  *v)/{1  +  *v)  for  plane  stress 
conditions,  *v  is  the  Poisson's  ratio,  and  O(r)  represents  terms 
of  order  r  or  higher 

If  we  neglect  the  higher  order  terms  as  r  -+  0,  and  evaluate 
Eqs,  f  17— 19}  at  0  =  ±180°,  the  displacements  along  the  free 


crack  faces  arc  given  as 
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Equations  (20-22)  yield  to 
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FIG.  8b.  Comparison  of  Ehe  x- displacement  and  von  Mises  stress  al  location  B  (H.O  mm.  7,96  mm). 
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Km  = 


G  1 A  tu| 

I+X 


(25) 


where  Au,  Ay.  and  Auj  are  the  relative  displacements  of  one 
crack  face  with  respect  to  the  other  ^Jr,  and  can  be 
calculated  based  on  the  nodal  displacements  and  locations  along 
free  surfaces  of  the  crack  determined  from  the  MPM  computa¬ 
tion,  and  subsequently  Kf ,  Kn ,  and  Km  are  obtained  using 
Eq.  (25). 


5,  MPM  AND  FEM  MODELING 

Dynamic  MPM  and  FEM  analyses  are  conducted  on  two 
kinds  of  tension  models  (uncracked  and  cracked  plates).  (I) 
First,  to  validate  the  method  of  using  an  irregular  mesh  in  MPM, 
a  tension  model  without  a  crack  is  simulated  using  both  regu¬ 
lar  and  irregular  MPM  meshes,  as  well  as  an  FEM  model  by 
ABAQUS/Explidt  code.  The  geometry  and  boundary  condi¬ 


tions  of  the  tension  model  are  shown  in  Figure  4a.  A  tensile 
step  load  of  400  N  is  applied  at  the  right  end  and  the  left  end 
is  fixed.  For  the  detailed  comparison  of  MPM  and  FEM  results, 
three  specific  locations  A,  B,  and  C  are  chosen  in  the  sample 
to  validate  the  MPM  algorithm,  shown  in  Figure  4a,  Point  A 
is  chosen  far  away  from  points  B  and  C  along  longitudinal  di¬ 
rection  to  demonstrate  the  stress  wave  propagation.  Points  B 
and  C  arc  located  close  together  in  the  x-di  reel  ion  but  are  sepa¬ 
rated  in  the  y-direction  to  verify  the  uniformity  of  the  von  Mises 
stress  distribution  along  the  cross-section  (2)  To  demonstrate 
the  capability  of  the  MPM  with  irregular  mesh,  another  ten¬ 
sion  model  with  an  inclined  crack,  shown  in  Figure  4b,  is  simu¬ 
lated  using  both  the  MPM  algorithm  and  FEM  with  an  adaptive 
mesh.  Both  ends  of  the  plate  are  subjected  to  tensile  loads  of 
2, 000  N.  An  inclined  crack  of  2  mm  length  oriented  at  an  an¬ 
gle  of  55°  is  located  at  the  center  of  the  plate.  For  the  above 
two  uncrackcd  and  cracked  tension  models,  a  linear  elastic, 
isotropic,  and  homogeneous  material  is  considered  in  this  study. 


FIG.  8c 


Comparison  of  the  .('displacement  and  von  Mises  stress  nt  location  C  (7.71  mm.  5.49  mm). 
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The  linear  clastic  properties  of  the  material  are  taken  as:  mass 
density  p  —  1  gm/cm3,  Young's  modulus  E  —  10000  MPa* 
and  Poisson's  ration  =  0.001  [16]*  in  both  MPM  and  FEM 
models,  free  surfaces  of  the  crack  arc  introduced  using  an  ap¬ 
proach  that  allows  the  placement  of  two  sets  of  nodes  at  the  same 
locations  along  the  crack  line  to  avoid  the  interaction  between 
neighboring  cells/elcments  in  the  two  sides  of  the  crack  line, 
as  shown  in  Figures  5  and  6.  To  allow  the  direct  comparison 
between  MPM  and  FEM  results*  meshes  used  in  both  models 
are  identical  and  are  shown  in  Figures  5  and  6,  To  generate  the 
MPM  and  FEM  meshes,  I-Deas  software  is  used  for  preprocess¬ 
ing  and  ABAQUS/Explieit  is  used  for  dynamic  FEM  analysis. 
To  characterize  the  stress  singularity  near  the  crack  tip,  very  fine 
mesh  is  used  around  the  crack  and  coarse  mesh  is  used  in  the  far 
field  to  minimize  the  computation  time  and  to  raise  the  computa¬ 


tion  capacity.  This  adaptive  mesh  will  also  improve  the  accuracy 
of  fracture  parameter  calculations.  During  the  entire  computa¬ 
tion  in  MPMt  the  background  grid  mesh  never  changes,  Thus* 
material  points  can  move  in  or  out  of  the  cells  after  deformation. 

Figures  5  and  6  show  MPM  and  FEM  meshes  for  the  ten¬ 
sion  models  without  a  crack  and  with  an  inclined  crack,  respec¬ 
tively,  For  the  uncracked  plate*  there  are  600  material  points 
in  the  MPM  model  with  the  regular  mesh  (Figure  5a)  and 
1,616  material  points  in  the  MPM  model  with  irregular  mesh 
(Figure  5b).  In  the  FEM  model,  there  are  433  nodes  and  404  el¬ 
ements  (Figure  5c).  For  the  tension  plate  with  an  inclined  crack, 
the  MPM  model  with  an  irregular  mesh  has  14*628  material 
points  (Figure  6a)  and  FEM  model  has  3*767  nodes  and  3.697 
elements  (Figure  6b).  For  explicit  dynamic  MPM/FEM  anal¬ 
ysis*  the  stable  time  step  increment  depends  on  the  length  of 


o  0.2  0.4  0,6  o.a  i 

Distance  from  crack  tip  (/run  ) 

(a) 


HG.  9.  von  Mises  stress  distributions  along  inclined  crack  line  near  ihe  crack  tip,  (a)  von  Mises  stress  distributions  along  inclined  crack  hnc  near  crack  tip  at 
different  times  using  irregular  mesh  MPM  analysis  (b)  Comparison  of  von  Mises  stress  distribution  along  crack  hue  near  crack  tip  at  t  -  4  /is. 
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MG.  10,  (;i)  Mode  I  Variation  of  energy  release  rate  with  time,  th)  Mode  II:  Variation  of  energy  release  rate  with  time  {cj  Variation  or  the  total  energy  release 

rate  versus  lime. 
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482 

the  cell/etemenl  and  ihc  longitudinal  bur  wave  velocity,  and  the 
minimum  lime  step  increment  was  used  lo  carry  out  the  numer¬ 
ical  simulations.  Table  1  lists  details  of  the  total  simulation  time 
and  the  lime  step  increment  for  each  model.  For  uncrackcd  ten¬ 
sion  model  a  uniform  force  was  applied  at  one  end  of  the  plate 
and  the  other  end  is  constrained  only  along  the  x -direct ion.  For 
cracked  model,  uniform  forces  are  applied  at  both  ends  of  the 
plate. 

6.  RESULTS  AND  DISCUSSION 

Explicit  dynamic  analyses  were  conducted  on  the  unc racked 
tensile  specimen  using  two  MPM  models  (regular  and  irregular 
meshes)  and  one  FEM  model  After  running  dynamic  MPM  and 
FEM  simulations*  displacement  and  stress  fields  with  the  time 


were  obtained  for  these  three  tension  models.  As  an  equivalent 
scalar  stress  measure*  von  Miscs  stress  was  selected  to  represent 
the  stress  field  ahead  of  the  crack  lip,  since  a  mixed  mode  crack 
was  introduced  in  the  model  Figure  7  shows  von  Miscs  stress 
contours  at  t  =  5  pj  for  the  two  MPM  models  (regular  and 
irregular  meshes)  and  the  FEM  model.  The  results  show  that 
Ihc  overall  profiles  of  three  dynamic  stress  contours  are  fairly 
consistent.  Figure  8  gives  a  comparison  of  the  curves  for  the 
displacements  along  the  ^-direction  and  von  Miscs  stresses  as  a 
function  of  time  at  specific  locations  A,  B,  and  C  respectively* 
in  the  specimen  as  indicated  in  Figure  4a.  Simulation  results 
in  Figure  8  also  show  that  the  stress  wave  travels  from  right  to 
left  with  a  longitudinal  bar  wave  velocity  c  =  3*162  m/s  for  all 
three  models.  After  the  stress  wave  arrives  at  these  three  specific 
locations*  material  points  there  start  moving  and  the  von  Mises 
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FIG  1 1  tal  Mode  I;  Variation  of  stress  intensity  factor  with  time,  (bl  Mode  II:  Variation  of  stress  intensity  factor  with  time. 
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stress  rapidly  increases  and  reaches  very  dose  to  the  theoretical 
equilibrium  value  cr=40  MPa.  It  can  be  seen  that  all  results  for 
the  three  models  in  Figure  7  and  Figure  8  arc  in  good  agreement 
indicating  the  validity  of  the  MPM  with  an  irregular  grid  mesh 
for  the  uncraeked  specimen. 

For  the  tension  model  with  an  inclined  crack  in  MPM  anal¬ 
ysis.  ihe  von  Mises  stress  distributions  along  the  inclined  crack 
line  near  ihe  crack  tip  at  different  times  are  shown  in  Figure  9a.  In 
this  figure,  a  high  stress  gradient  exists  due  to  a  stress  singularity 
near  the  crack  tip  as  expected.  Figure  9b  depicts  a  comparison 
of  irregular  mesh  MPM  and  FEM  results  at  t  4  [is,  indicating 
ihai  the  results  of  both  MPM  and  FEM  are  in  very  good  agree¬ 
ment  for  this  example.  A  comparison  with  FEM  shows  that  the 
MPM  with  an  irregular  mesh  is  able  to  accommodate  the  in¬ 
clined  crack  issues  accurately  and  implement  explicit  fracture 
mechanics  computations.  An  adaptive  mesh  is  used  for  simu¬ 
lating  the  crack  in  the  MPM  model  to  improve  the  accuracy 
of  fracture  parameter  calculations.  Using  the  virtual  crack  clo¬ 
sure  method,  the  dynamic  energy  release  rates  in  Mode  1  and 
Mode  II  are  calculated  and  shown  in  Figure  10.  It  can  be  seen 
from  Figure  10  that  MPM  results  are  in  good  agreement  with 
dynamic  explicit  FEM  results.  These  MPM  and  FEM  results 
indicate  that  the  stress  waves  propagate  from  both  ends  to  the 
center.  After  the  stress  wave  arrives  near  the  crack,  the  crack 
gradually  starts  opening  and  the  energy  release  rates  increase 
with  time.  It  also  shows  that  contributions  from  both  Mode  l 
and  Mode  IE  energy  release  rates  are  significant  for  the  inclined 
crack  problem,  as  one  might  expect.  Additionally,  the  displace¬ 
ment  extrapolation  method  is  used  to  determine  the  Mode  l  and 
Mode  II  stress  intensity  factors  for  MPM  and  FEM  models. 
Comparisons  of  the  stress  intensity  factors  as  a  function  of  lime 
are  shown  in  Figures  1  la  and  I  lb,  indicating  that  irregular  mesh 
MPM  and  FEM  results  agree  very  well.  Based  on  the  good  agree¬ 
ment  between  MPM  and  FEM  models  with  an  inclined  crack, 
it  is  evident  that  the  proposed  MPM  algorithm  with  an  irregular 
grid  mesh  can  be  used  to  simulate  mixed  mode  crack  prob¬ 
lems  and  for  implementing  explicit  dynamic  fracture  mechanics 
computations. 


7,  CONCLUSIONS 

A  methodology  for  implementing  an  irregular  mesh  in  MPM 
has  been  developed  to  enable  the  use  of  arbitrary  quadrilateral 
cells  in  MPM  to  model  the  inclined  dislocations  and  cracks  for 
use  of  muliiscalc  simulation  using  MPM/MD.  The  use  of  an  ar¬ 
bitrary  quadrilateral  MPM  cells  also  makes  it  convenienl  for  cell 
refinement  in  areas  with  high  stress  gradients.  Dynamic  MPM 
simulations  of  an  uncraeked  tension  model  indicate  that  results 
from  both  regular  and  irregular  meshes  are  consistent  and  agree 
well  with  the  FEM  results  using  ABAQUS/Explicil  code,  pro¬ 
viding  validation  for  the  MPM  with  the  use  of  irregular  mesh, 
For  mixed  mode  fracture  analysis,  a  tension  model  with  an  in¬ 
clined  crack  was  also  si  initiated  using  the  MPM  algorithm  with 
an  adaptive  mesh.  The  adaptive  mesh  provides  accurate  frac¬ 


ture  parameter  calculations.  From  dynamic  MPM  simulations, 
the  Mode  I  and  Mode  U  energy  release  rates  were  calculated 
and  found  to  be  in  good  agreement  with  those  obtained  from 
dynamic  FEM  analysis.  Mode  I  and  Mode  II  stress  intensity 
factors  were  also  calculated  from  dynamic  MPM  simulations. 
Compared  with  FEM.  MPM  takes  advantage  of  both  Eulerian 
and  Lagrangian  descriptions  to  give  it  the  capability  of  handling 
large  deformations  in  a  more  natural  manner  so  that  locking 
problems  present  in  FEM  [9,  30]  is  avoided.  Additionally,  be¬ 
cause  MPM  uses  discrete  material  points  to  represent  a  material 
continuum,  and  because  material  points  are  a  description  consis¬ 
tent  with  the  panicle  description  (atoms)  in  molecular  dynamics 
(MD),  coupling  between  MPM  and  MD  is  compatible. 
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Multiscale  Simulations  Using  Generalized  Interpolation  Material  Point  (GIMP) 
Method  And  SAMRAI  Parallel  Processing 


J.  Ma1,  H.  Lul,  B.  Wang',  S.  Roy 1 ,  R.  Horaung2 3,  A.  Wissink2  and  R.  Komanduri1' 


Abstract:  In  the  simulation  of  a  wide  range  of  mechan- 
ics  problems  including  impact/contact/penetration  and 
fracture,  the  material  point  method  (MPM),  Sulsky,  Zhou 
and  Shreyer  (1995),  demonstrated  its  computational  ca¬ 
pabilities.  To  resolve  alternating  stress  sign  and  insta¬ 
bility  problems  associated  with  conventional  MPM,  Bar- 
denhagen  and  Kober  (2004)  introduced  recently  the  gen¬ 
eralized  interpolation  material  point  (GIMP)  method  and 
implemented  for  one-dimensional  simulations.  In  this 
paper  we  have  extended  GIMP  to  2D  and  applied  to  sim¬ 
ulate  simple  tension  and  indentation  problems.  For  sim¬ 
ulations  spanning  multiple  length  scales,  based  on  the 
continuum  mechanics  approach,  we  present  a  parallel 
GIMP  computational  method  using  the  Structured  Adap¬ 
tive  Mesh  Refinement  Application  Infrastructure  (SAM¬ 
RAI).  SAMRAI  is  used  for  multi-processor  distributed 
memory  computations,  as  a  platform  for  domain  decom¬ 
position,  and  for  multi-level  refinement  of  the  computa¬ 
tional  domain.  Nested  computational  grid  levels  (with 
successive  spatial  and  temporal  refinements)  are  used  in 
GIMP  simulations  to  improve  the  computational  accu¬ 
racy  and  to  reduce  the  overall  computational  time.  The 
domain  of  each  grid  level  is  divided  into  multiple  rect¬ 
angular  patches  for  parallel  processing.  This  domain  de¬ 
composition  embedded  in  SAMRAI  is  very  flexible  when 
applied  to  GIMP.  As  an  example  to  validate  the  parallel 
GIMP  computing  scheme  under  SAMRAI  parallel  com¬ 
puting  environment*  numerical  simulations  with  multi¬ 
ple  length  scales  from  nanometer  to  millimeter  were  con¬ 
ducted  on  a  2D  nanoindentation  problem.  A  contact  a]- 
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gorithm  in  GIMP  has  also  been  developed  for  the  treat¬ 
ment  of  contact  pair  between  a  rigid  indenter  and  a  de¬ 
formable  workpiece,  GIMP  results  are  compared  with 
finite  element  results  on  indentation  for  validation.  A 
GIMP  nano  indentation  problem  with  five  levels  of  refine¬ 
ment  was  modeled  using  multi-processors  lo  demonstrate 
the  potential  capability  of  the  parallel  GIMP  computa¬ 
tion, 

keyword:  Material  point  method  (MPM)*  Generalized 
interpolation  material  point  method  (GIMP),  Tension, 
Nanoindentation,  Parallel  computing*  SAMRAI,  Multi¬ 
level  refinement.  Contact  problem, 

1  Introduction 

The  material  point  method  (MPM)  has  demonstrated  its 
capabilities  in  addressing  such  problems  as  impact,  up¬ 
setting,  penetration*  and  contact  (e.g*  Sulsky,  Zhou  and 
Schreyer  (1995);  Sul  sky  and  Schreyer  (1996)).  In  MPM, 
two  descriptions  are  used  -  one  based  on  a  collection  of 
material  points  (Lagrangian)  and  the  other  based  on  a 
computational  background  grid  (Eulerian),  as  proposed 
by  Sulsky*  Zhou  and  Schreyer  (1995),  A  fixed  struc¬ 
tured  mesh  is  generally  used  in  the  background  through¬ 
out  the  MPM  simulations.  The  material  points  are  fol¬ 
lowed  throughout  the  deformation  of  a  solid  to  provide 
a  Lagrangian  description  and  the  governing  field  equa¬ 
tions  are  solved  at  the  background  grid  nodes  so  that 
MPM  is  not  subject  to  mesh  entanglement.  Compared 
to  the  finite  element  method  (FEM)*  MPM  takes  ad¬ 
vantage  of  both  Eulerian  and  Lagrangian  descriptions 
and  possesses  the  capability  of  handling  large  defor¬ 
mations  in  a  more  natural  manner  so  that  mesh  lock¬ 
up  problems  present  in  FEM  are  avoided.  Addition¬ 
ally,  for  problems  involving  contact,  MPM  provides  a 
naturally  non-slip  contact  algorithm  to  avoid  the  pene¬ 
tration  between  two  bodies  based  on  a  common  back¬ 
ground  mesh  (Sulsky*  Zhou  and  Schreyer  (1995),  Sul- 


59 


1 36  Copyright  ©  2005  Tech  Science  Press 


CMBSt  voL8t  no, 2,  ppJ3S-J52,  2005 


sky  and  Schreyer  (1996)).  One  drawback  of  the  con¬ 
ventional  MPM  is  that  when  the  material  points  move 
across  the  cell  boundaries  during  deformation,  some  nu¬ 
merical  noise/errors  can  be  generated,  Bardenhagen  and 
Kober  (2004).  To  solve  the  instability  problems  asso¬ 
ciated  with  the  conventional  MPM  simulations,  Barden¬ 
hagen  and  Kober  recently  proposed  the  generalized  inter¬ 
polation  material  point  method  (GIMP)  and  implemented 
for  one-dimensional  simulations. 

The  present  investigation  extends  the  GIMP  presented 
by  Bardenhagen-Kober  to  two-dimensional  simulations 
and  applies  it  to  simple  tension  and  indentation  problems. 
Furthermore,  a  refinement  technique  and  a  parallel  pro¬ 
cessing  scheme  are  developed  so  that  the  serial  GIMP  al¬ 
gorithm  and  code  can  be  extended  for  parallel  computa¬ 
tion  of  large  scale  computations  based  on  the  continuum 
mechanics  approach. 

Parallel  processing  has  been  used  successfully  in  nu¬ 
merical  analysis  using  different  methods,  such  as  FEM 
and  boundary  element  method  (BEM),  Mackerle  (2003) 
and  molecular  dynamics  (MD),  Kalia  and  Nakano 
(1993).  The  computational  time  on  parallel  processors 
can  be  reduced  to  a  small  fraction  of  the  time  con¬ 
sumed  by  a  single  processor  at  the  same  speed.  Par¬ 
allel  processing  generally  involves  issues,  such  as  do¬ 
main  decomposition/partitioning,  load  balancing,  par¬ 
allel  solver/algorithms,  parallel  mesh  generation,  and 
multi-grid,  Mackerle  (2003).  Domain  decomposition  has 
been  widely  applied  in  parallel  processing  in  FEM,  Hsien 
(1997),  With  partitioning  of  the  overall  computational 
domain,  sequential  FEM  algorithm  usually  cannot  be 
used  directly  in  parallel  processing  without  some  modifi¬ 
cation,  primarily  due  to  the  coupling  of  a  large  number  of 
simultaneous  linear  equations.  Remeshtng  is  sometimes 
needed  in  each  sub-domain.  The  interfaces  of  neighbor¬ 
ing  sub-domains  must  be  meshed  identically  for  subse¬ 
quent  communications,  Mackerle  (2003).  These  prob¬ 
lems  are  intrinsic  to  certain  numerical  methods,  such  as 
FEM;  however,  they  can  be  totally  or  partially  avoided  if 
other  appropriate  computational  methods  are  used.  For 
example,  the  domain  decomposition  is  more  straightfor¬ 
ward  for  structured  meshes,  and  large  systems  of  coupled 
equations  can  be  avoided,  if  explicit  lime  integration  is 
used. 

Recently  a  platform  for  parallel  computation,  namely,  the 
structured  adaptive  mesh  refinement  application  infras¬ 
tructure  (SAMRAI),  Hornung  and  Kohn  (2002),  has  been 


developed  by  the  Center  for  Applied  Scientific  Com¬ 
puting  at  the  Lawrence  Livermore  National  Laboratory. 
SAMRAI  has  provided  interfaces  for  user-defined  data 
types  so  that  material  points  carrying  physical  variables 
(mass,  displacement,  velocity,  acceleration,  stress,  strain, 
etc.)  can  be  readily  defined.  As  a  result,  SAMRAI  is  very 
suitable  for  handling  material  points  and  their  physical 
variables  in  MPM  or  its  variant,  GIMP.  In  this  investi¬ 
gation  SAMRAI  is  used  for  parallelizing  GIMP,  SAM¬ 
RAI  has  also  provided  a  foundation  for  parallel  adap¬ 
tive  mesh  refinement  (AMR)  with  the  use  of  either  dy¬ 
namic  or  static  load  balancing,  Wissink,  Hysom  and  Hor¬ 
nung  (2003).  This  function  allows  SAMRAI  to  process 
both  spatial  and  temporal  refinements  in  areas  of  inter¬ 
est,  typically  with  high  gradients  in  some  physical  vari¬ 
ables  (e.g.,  strains),  and  to  use  coarse  mesh  in  the  remain¬ 
ing  areas.  With  the  appropriate  use  of  fine  and  coarse 
meshes  in  different  regions,  multiscale  simulations  us¬ 
ing  MPM  can  provide  desired  computational  accuracy 
with  reduced  costs  associated  with  computer  memory 
and  computational  time. 

Material  multiscale  simulations  span  from  elec¬ 
tronic  structure,  atomistic  scale,  crystal  scale,  to 
macro/contmuum  scale,  Horstemeyer,  Baskes,  Prantil, 
Phiiliber  avd  Vonderheide  (2003);  Komanduri,  Lu, 
Roy,  Wang  and  Raff  (2004).  Appropriate  simulation 
algorithms  can  be  used  at  various  scales,  e.g.,  ah 
initio  computation  for  electronic  structure,  molecular 
dynamics  at  the  atomistic  scale,  crystal  plasticity  or 
mesoplasticity  at  the  crystal  scale,  and  continuum  me¬ 
chanics  at  the  macro  scale,  Horstemeyer,  Baskes,  Prantil, 
Phiiliber  and  Vonderheide  (2003).  At  the  continuum 
scale,  FEM  is  generally  used.  Recently,  the  meshless 
local  Petrov  -  G  ale  rkin  (MLPG)  method  (Shen  and  Atluri 
(2004a,  2005))  and  the  eontinuum/iattice  Green  function 
method  (Tewary  and  Read  (2004))  have  been  used  to 
couple  with  molecular  dynamics  seamlessly.  The  MLPG 
method  is  a  simple,  less-costly  alternative  approach 
to  FEM,  Atluri  and  Shen  (2002).  For  the  purposes  of 
providing  the  insights  into  the  discrete  atomistic  system 
and  coupling  with  continuum,  an  equivalent  continuum 
was  defined  in  the  MD  region  to  compute  the  atomic 
stress  based  on  the  Smoothed  Particle  Hydrodynamics 
(SPH)  method,  Shen  and  Atluri  (2004b).  The  atomic 
stress  tensor  computed  using  the  SPH  method  is  more 
natural  than  other  atomic  stress  formulations  because 
it  is  in  the  nonvolume-averaged  form  and  rigorously 


60 


Mufffscafc  Simulations  Using  Generalized  interpolation  Material  Point  (GIMP)  Method  And  S  A  MR  A I  Para  f  Id  Processing]  37 


satisfies  the  conservation  of  linear  momentum.  Hence,  it 
is  applicable  to  both  homogeneous  and  inhomogeneous 
deformations.  A  tangent  stiffness  formulation  was 
developed  for  both  MLPG  and  MD  regions  and  the 
displacements  of  the  nodes  and  atoms  are  solved  in  one 
coupled  set  of  linear  equations.  The  MLPG/MD  cou¬ 
pling  has  been  demonstrated  to  be  capable  of  enforcing 
the  local  balance  equations  in  the  handshaking  region 
between  continuum  mechanics  and  molecular  dynamics* 
Shen  and  Atluri  (2005), 


p(x,r)  =  X  M„6(X"XL),  where  Np  is  the  total  nurm 
p=  l 

ber  of  material  points  and  Mp  is  the  mass  of  the  ma¬ 
terial  point.  Upon  discretization  of  Eq,  (I)  using  the 
shape  functions  A/, ( ) .  the  governing  equations  at  the 
background  grid  nodes  become  (see  Sulsky,  Zhou  and 
Schreyer  { 1 995)) 

=  +  (2) 

where  the  lumped  mass  matrix  is  given  by 


The  simulation  using  parallel  GIMP  computing  scheme 
in  this  investigation  will  focus  on  multiscales,  e.g.,  from 
nanometer  to  millimeter,  based  on  the  continuum  me¬ 
chanics  approach,  namely,  2D  GIMP.  An  example  used 
for  validating  the  simulation  at  several  length  scales  at 
the  continuum  level  is  nano  indentation.  It  involves  the 
contact  issue  between  a  rigid  indenter  and  a  deformable 
workpiece.  A  contact  algorithm,  which  allows  the  con¬ 
tact  interface  to  be  located  in  a  few  computational  do¬ 
mains,  is  introduced  in  this  study.  The  contact  pressure  is 
determined  from  solving  a  set  of  equations  from  multiple 
processors.  Parallel  GIMP  results  on  nanoindentation  are 
compared  with  FEM  results  using  the  ABAQUS/Explicit 
code.  A  nanoindentation  model  with  five  levels  will  be 
used;  this  model  allows  simulation  from  nanometer  to 
millimeter  scales, 

2  Generalized  Interpolation  Material  Point  (GIMP) 
Method 

The  governing  equations  in  both  conventional  material 
point  method  (MPM),  Sulsky,  Zhou  and  Sheryer  (1995), 
Hu  and  Chen  (2003),  Bardenhagen  (2002)  and  general¬ 
ized  interpolation  material  point  (GIMP)  method,  Bar¬ 
denhagen  and  Kober  (2004),  are  briefly  summarized  in 
this  section.  The  weak  form  of  the  momentum  conserva¬ 
tion  equation  in  the  conventional  MPM  is  given  by 

/  pw  ■  =  -  [  pss  :  V  w dQ 

Jq  Jq  f  0) 

+  /  pcs  ■  w dS  +  /  pw  ■  b  V£2 

Jdn  Jq 

where  w  is  the  test  function,  a  is  the  acceleration,  and 
s\  cl  and  b *  are  the  specific  stress,  specific  traction,  and 
specific  body  force,  respectively.  £2  is  the  current  con¬ 
figuration  and  dft  is  the  surface  with  applied  traction. 
The  material  density,  p,  can  be  approximated  as  the  sum 
of  material  point  masses  using  a  Dirac  delta  function 


n'i  =  X  MpNitfp), 
p=  i 

and  the  internal  and  external  forces  are  given  by 

(fi*  =  -  X  Mrf  ■  VN.l 

p=i 


(3) 


(4) 


(0“  =~t  +  X  Afpb;A,(x'p),  (5) 

p~\  P=\ 

where  h  is  the  thickness  of  a  boundary  layer.  At  each  time 
step,  all  variables  for  each  material  point,  such  as  mass, 
velocity,  and  force  are  extrapolated  to  the  grid  nodes  of 
the  cell  in  which  the  material  point  resides.  New  nodal 
momenta  are  computed  and  used  to  update  the  physi¬ 
cal  variables  carried  by  the  material  points.  Thus,  ma¬ 
terial  points  move  relative  to  each  other  to  represent  de¬ 
formation  in  a  solid.  A  spatially  fixed  background  grid 
is  used  throughout  the  MPM  computation.  MPM  has 
already  demonstrated  its  capabilities  in  solving  a  num¬ 
ber  of  problems  involving  impact/con  tact/penetration.  In 
case  of  large  deformation,  however,  numerical  noise, 
or  errors  have  been  observed,  especially  when  material 
points  have  just  crossed  cell  boundaries  resulting  in  insta¬ 
bility  problems  in  the  MPM  simulations  (see,  e.g.,  Sul¬ 
sky,  Zhou  and  Schreyer  (1995),  Hu  and  Chen  (2003), 
Bardenhagen  and  Kober  (2004)).  The  primary  cause  for 
the  problem  has  been  attributed  to  the  discontinuity  of 
the  gradient  of  the  shape  functions  across  the  cell  bound¬ 
aries  (see,  e.g.,  Hu  and  Chen  (2003),  Bardenhagen  and 
Kober  (2004)).  To  resolve  this  problem,  Bardenhagen 
and  Kober  (2004)  proposed  a  generalized  interpolation 
material  point  (GIMP)  method.  In  GIMP,  the  interpola¬ 
tion  between  node  i  and  material  point  p  is  given  by  the 
volume  averaged  weighting  function 


^ip  ^  y~  I 


£jnaP 


(6) 
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where  Vp  is  the  current  volume  of  the  material  point, 
X,j(x)  is  the  characteristic  function  of  the  material  point, 
and  £/(x)  is  the  node  shape  function.  The  role  of  the 
weighting  function  is  the  same  as  the  shape  function  in 
conventional  MPM.  The  modified  equation  of  momen¬ 
tum  conservation,  Bardenhagen  and  Kober  (2004),  can 
he  written  as 


X  [  ^^-5 V£/x  +  £  I  CpXp  :  5w/x 


nnn„ 


an  nP 


■?/ 


,npXp 


£inQ„ 


b-5 vr/x  +  Jc  -&vdx 
Bn 


(7) 


where  8v  is  an  admissible  velocity  field,  pp  is  the  rate  of 
change  of  the  material  point  momentum.  Eq.  (7)  can  be 
further  discretized  and  solved  at  the  grid  nodes,  Barden* 
hagen  and  Kober  (2004),  Herein,  the  weighting  function 
Sip  is  C1  continuous  under  the  spatially  fixed  background 
grid.  Consequently  the  noises  associated  with  material 
point  crossing  cell  boundaries  in  the  conventional  MPM 
can  be  minimized. 

In  this  paper,  we  have  implemented  GIMP  presented 
by  Bardenhagen-Kober  for  two-dimensional  simulations. 
We  have  also  developed  a  refinement  technique  and  a  par* 
allel  processing  scheme  to  extend  the  serial  GIMP  algo¬ 
rithm  to  code  large  scale  parallel  computing.  The  ca¬ 
pability  of  parallel  GIMP  computing  has  been  demon¬ 
strated  by  modeling  nano  indentation  problem,  A  con¬ 
tact  algorithm  has  been  developed  to  address  the  contact 
problem  between  the  rigid  indenter  and  the  deformable 
workpiece.  We  proceed  next  to  describe  the  contact  al¬ 
gorithm  developed  in  this  investigation. 


2J  Contact  Algorithm  in  GIMP 

Nanoindentation  involves  a  contact  pair  of  a  rigid  inden¬ 
ter  and  a  deformable  workpiece.  The  contact  interac¬ 
tion  between  these  two  surfaces  is  governed  by  the  New¬ 
ton’s  third  law  and  Coulomb’s  friction  law  as  well  as  the 
boundary  compatibility  condition  at  the  contact  interface, 
Oden  and  Pires  (1983),  Zhong  (1993),  While  MPM  can 
prevent  the  penetration  at  the  interface  automatically,  it 
uses  a  single  mesh  for  the  two  bodies.  At  the  contact  sur- 
face,  all  components  of  the  variables  are  interpolated  to 
the  nodes  from  both  bodies  using  Eqs,  (3)-(fi),  As  a  re¬ 
sult,  MPM  using  a  single  mesh  tends  to  induce  early  con¬ 
tact  in  approaching  and  late  separation  when  two  parts 
move  away  from  each  other.  So,  MPM  cannot  model 


the  contact  behavior  between  two  pans  correctly,  Hu  and 
Chen  (2003)  proposed  a  multi-mesh  MPM  algorithm  to 
release  the  no-slip  constraint  inherent  in  the  MPM  using 
a  single  mesh.  In  the  multi-mesh  MPM,  in  addition  to  a 
common  mesh  for  all  objects,  there  is  an  individual  mesh 
for  each  of  the  objects  under  consideration.  All  meshes 
are  identical,  i.e.  nodal  locations  are  the  same.  The  multi - 
mesh  can  be  generated  by  creating  multiple  nodal  fields 
for  each  node.  Each  nodal  field  corresponds  to  an  ob¬ 
ject.  In  multi-mesh  MPM  scheme,  the  nodal  masses  and 
forces  are  mapped  from  the  material  points  of  each  ob¬ 
ject  to  its  own  mesh.  The  nodal  values  are  transferred 
to  the  corresponding  nodes  in  the  common  mesh.  When 
the  values  at  a  node  of  the  common  background  mesh  in¬ 
volve  contributions  from  two  parts,  the  contact  between 
two  parts  occurs  so  that  this  node  is  defined  as  an  over¬ 
lapped  node.  Otherwise,  two  pans  move  independently. 
This  multi- mesh  algorithm  can  handle  sliding  and  sepa¬ 
ration  for  the  contact  pair.  However,  in  using  the  multi- 
mesh  for  contact  problem  in  GIMP,  the  interaction  at  the 
overlapped  nodes  is  still  activated  too  early  before  the 
actual  contact  of  the  material  points  occurs. 

Fig.  1  is  an  example  illustrating  early  contact  when  Part 
l  is  moving  toward  Part  2,  The  four  bottom  particles  of 
Part  1 ,  labeled  in  hollow  circles,  have  come  into  the  cells 
of  Part  2,  the  nodes  of  which  cells  are  labeled  in  three 
dashed  circles.  Physical  variables  (e.g.,  normal  force, 
and  velocity  component  normal  to  the  contact  surface) 
in  Part  1  will  be  interpolated  onto  these  three  overlapped 
nodes.  The  physical  variables  in  the  three  overlapped 
nodes  will  be  further  interpolated  into  material  points 
within  the  top  layer  of  cells  in  Part  2,  and  contribute 
to  the  stress  and  deformation  in  the  entire  Part  2,  With 
this  treatment  in  the  previous  multi -mesh  algorithm,  even 
though  Parts  1  and  2  are  not  in  physical  contact,  the  par¬ 
ticles  in  Part  2  will  contribute  to  the  physical  variables  of 
particles  in  Part  1,  leading  to  numerical  early  contact,  and 
vice  versa,  through  the  overlapped  nodes.  Similar  situa¬ 
tion  occurs  w'hen  Part  1  is  retracting  from  Part  2,  resulting 
in  late  separation  of  two  parts.  Unless  other  measures  are 
taken  to  prevent  these  physically  incorrect  early  contact 
and  late  separation  problems,  they  could  cause  large  er¬ 
rors  in  GIMP  and  must  be  corrected  in  contact  problems. 
In  this  paper,  a  new  contact  algorithm  is  developed  for 
GIMP  simulations.  Fig.  2  illustrates  the  contact  algo¬ 
rithm  for  the  contact  pair  between  a  rigid  indenter  and 
a  deformable  workpiece.  Although  circular  points  are 
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Figure  1  ;  Illustration  of  early  contact  in  multi-mesh 
indenter 


t  Workpiece 

Figure  2  :  Schematic  of  contact  algorithm  between  the  rigid  indemerand  the  deformable  workpiece 


used  in  this  schematic  diagram,  it  should  be  noted  that 
the  points  are  representations  of  areas  occupied  by  these 
points,  based  on  the  GIMP  algorithm.  A  frictionless  con¬ 
tact  is  assumed  in  this  investigation.  At  the  beginning  of 
a  time  step,  a  material  point  is  located  at  point  A,  At  the 
end  of  this  time  step,  the  material  point  moves  to  B,  if 
there  is  no  contact  interaction. 


point  pt  Ff  is  the  contact  force  on  node  i,  $  and  F?  are 
the  nodal  momentum  and  force  without  consideration  of 
the  contact,  respectively.  The  velocity  \°p  of  the  material 
point  without  the  consideration  of  contact  is  given  by 


*p°  +  Ffo 
vn~  L  „  *<1 


(9) 


To  satisfy  the  displacement  compatibility  condition,  the 
material  point  has  to  be  brought  to  the  indenter  surface 
and  kept  in  contact  with  the  indenter.  The  contact  veloc¬ 
ity  correction  \cp  can  be  determined  based  on  the  rigid 
surface  orientation  indicated  by  its  unit  outward  normal 
vector  n.  The  final  location  of  the  material  point  is  set  to 
C  by  a  contact  pressure.  Hence,  the  velocity  of  a  material 
point  p  under  contact  can  be  determined  by 


The  contact  force  Fj'  on  node  i  is  the  resultant  of  the 
contact  pressures  on  the  neighboring  particles,  and  can 
be  computed  in  terms  of  contact  pressures  using  the  ap¬ 
proach  given  by  Bardenhagen  and  Kober  (2004),  i.e„ 


i  ■ 

"p?  +  F°A/,  y  FfA/? 


(8) 


where  N  is  the  number  of  nodes  contributing  to  material 


where  Q  is  the  total  number  of  material  points  in  contact 
with  the  indenter.  If  the  contact  pressure  is  assumed 
to  be  constant  in  the  contact  area  occupied  by  material 
Q 

point  q*  we  have  F[  =  £  jT^PS,  where  Tiq  —  /  Sl{x)ds. 

an. 

Since  +  V£,  the  contact  velocity  \p  for  material 
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point  p  is  given  by 


Eq.  (II)  can  be  established  for  each  material  point  in 
contact.  At  each  material  point  there  is  an  unknown 
contact  pressure  F^,  Therefore,  the  number  of  unknown 
pressures,  F^,  is  equal  to  the  number  of  points  in  contact. 
In  parallel  computing,  points  in  contact  might  be  located 
in  different  domains  processed  by  different  processors. 
Consequently,  a  parallel  solver  is  needed  to  solve  Eq. 
(11)  in  this  investigation.  Since  contact  can  only  occur 
on  the  outer  surface  of  an  object,  Eq.  (11)  is  solved  ana¬ 
lytically  under  the  physical  contact  condition  F^  ■  n  <  0  to 
find  the  contact  pressure  at  all  material  points  in  contact 
with  the  indemer,  The  contact  pressure  is  then  extrap¬ 
olated  to  the  nodes  from  the  contact  material  points  to 
update  the  total  nodal  forces. 


2,2  Numerical  implementation 

We  consider  the  case  where  initially  there  are  four  mate¬ 
rial  points  in  a  cell  for  which  the  2D  weighting  function 
is  depicted  in  Fig,  3,  To  compute  the  weighting  function, 
we  take  X/?(x)  to  be  one  in  the  current  region  occupied 
by  the  material  point  p  and  zero  elsewhere.  In  this  figure, 
one  node  is  at  the  origin  and  the  horizontal  axes  give  ma¬ 
terial  point  positions  normalized  by  the  cell  size.  Fig.  3  is 
based  on  the  same  material  point  characteristic  function 
and  node  shape  function  as  in  Bardenhagen  and  Kober 
(2004).  It  is  noted  that  the  computation  of  the  weighting 
function  in  the  deformed  state  involves  some  practical 
difficulties  because  the  integration  boundaries  in  Eq.  (7) 
can  be  difficult  to  obtain.  To  circumvent  this  problem,  we 
assume  that  the  shape  of  the  region  occupied  by  the  four 
material  points  remains  rectangular  without  rotation,  so 
that  Eq,  (6)  can  be  evaluated  analytically.  This  assump¬ 
tion  leads  to  significant  saving  in  the  computational  time 
while  introducing  only  small  errors.  Using  this  assump¬ 
tion,  GIMP  is  extended  to  2D  simulations  and  the  results 
are  presented  in  Section  4. 


3  Parallel  Computing  Scheme  Using  GIMP  with 
SAMRAI 

3J  Structured  Adaptive  Mesh  Refinement  Applica¬ 
tion  Infrastructure  (SAMRAI) 

The  Structured  Adaptive  Mesh  Refinement  Applica¬ 
tion  Infrastructure  (SAMRAI),  a  scientific  computational 
package  for  structured  adaptive  mesh  refinement  and  par¬ 
allel  computation,  is  used  with  the  GIMP  for  parallel 
computation  of  large-scale  simulations.  SAMRAI  is  cho¬ 
sen  because  of  its  similarity  in  grid  structure  with  GIMP. 
In  GIMP,  the  computation  is  usually  independent  of  the 
background  grid  mesh  so  that  a  structured  spatial ly -fixed 
mesh  can  be  used  throughout  the  entire  simulation  pro¬ 
cess.  This  advantage  makes  GIMP  highly  suitable  for 
parallel  computation,  as  the  domain  decomposition  for 
structured  mesh  can  be  easily  performed  and  no  remesh¬ 
ing  is  required.  Thus  the  complexity  and  inefficiency  as¬ 
sociated  with  parallel  processing  can  be  avoided. 

In  SAMRAI,  the  computational  domain  is  defined  as 
a  hierarchy  of  nested  grid  levels  of  mesh  refinement, 
Berger  and  Oliger  (1984),  as  shown  in  Fig.  4.  Each 
grid  level  is  divided  into  non-overlapping,  logically* 
rectangular  patches,  each  of  which  is  a  cluster  of  compu¬ 
tational  cells.  Indices  are  used  extensively  in  SAMRAI  to 
manage  grid  levels  and  patches.  For  example,  patch  con* 
nectivity  is  managed  by  the  cell  indices.  The  organiza¬ 
tion  of  the  computational  mesh  into  a  hierarchy  of  levels 
of  patches  allows  data  communication  and  computation 
to  be  expressed  in  geometrically-mtuitive  box  calculus 
operations.  Communication  patterns  for  data  dependen¬ 
cies  among  patches  can  be  computed  in  parallel  without 
inter-processor  communications,  since  the  mesh  config¬ 
uration  is  replicated  readily  across  processor  memories. 
Inter-processor  communications,  i.e,,  data  communica¬ 
tions  between  patches  on  the  same  as  well  as  neighbor¬ 
ing  levels,  are  pre-defined  by  SAMRAI  communication 
schedules.  Problem-specific  communication  interfaces 
are  also  provided  by  SAMRAL 

SAMRAI  supports  several  data  types  defined  in  a  patch, 
such  as  cell -centered  data,  node-centered  data,  and  face- 
centered  data.  These  data  are  stored  as  arrays  to  allow 
numerical  subroutines  to  be  separated  easily  from  the  im¬ 
plementation  of  mesh  data  structures.  User-defined  data 
structures  over  a  patch,  which  can  be  accessed  through 
cell  index,  are  supported  by  SAMRAI.  These  charac¬ 
teristics  make  SAMRAI  a  very  flexible  parallel  com- 
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Figure  3  ;  Material  points  in  cells  and  the  weighting  function  in  2D  GIMP 


Figure  4  :  Illustration  of  a  hierarchy  of  three  nested  grid  levels  of  mesh  refinement 


puting  environment  for  numerous  physics  applications, 
Wissink,  Hysom,  and  Hornung  (2003). 

3.2  Spatial  and  Temporal  Refinements 

In  the  application  of  SAMRAl  to  large-scale  GIMP  sim¬ 
ulations,  the  techniques  for  refinement,  both  spatial  and 
temporal,  have  to  be  developed  to  achieve  high  accu¬ 
racy  in  areas  of  high  stress/strain  gradients  while  reduc¬ 
ing  the  overall  computational  time  by  using  coarse  mesh 
in  regions  of  low  stress/strain  gradients.  Since  a  struc¬ 
tured  mesh  is  used  in  GIMP,  the  refinement  can  be  im¬ 
plemented  by  imposing  fine  levels  of  sub-grids  at  loca¬ 
tions  of  interest,  using  the  approach  adopted  by  Berger 
and  Oliger  ( 1984)  in  SAMRAl.  The  scheme  for  the  struc¬ 
tured  grid  refinement  is  illustrated  in  Fig.  4.  The  cell  size 
ratio,  also  called  the  refinement  ratio,  of  two  neighboring 
levels  is  always  an  integer  for  convenience.  The  advan¬ 


tage  of  this  refinement  technique  is  that  nesting  relation¬ 
ships  between  different  levels  can  be  handled.  A  mate¬ 
rial  point  in  GIMP  can  be  split  into  several  small  material 
points.  Tan  and  Naim  (2004)  proposed  a  criterion  to  split 
material  points  based  on  local  deformation  gradient.  If 
the  refinement  ratio  is  two  in  each  direction,  one  coarse 
material  point  can  be  split  into  four  material  points  in  the 
next  fine  level  in  2D  case.  However,  this  splitting  tech¬ 
nique  can  become  complicated  when  conservation  of  en¬ 
ergy  and  momentum  have  to  be  enforced.  In  this  paper,  a 
more  natural  refinement  approach  is  developed  to  avoid 
direct  splitting  and  merging  processes  by  using  material 
points  of  the  same  size  and  mass  in  overlapped  region 
(called  ghost  region)  between  two  neighboring  levels. 

Fig.  5  shows  two  neighboring  coarse  and  fine  grid  lev¬ 
els  in  2D  GIMP  computations  with  a  refinement  ratio 
of  two.  The  thick  line  represents  the  physical  bound- 
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Figure  5  :  Two  neighboring  coarse  and  fine  grid  levelsin  2D  GIMP  computations 


ary  of  the  fine  level  with  four  layers  of  ghost  cells.  Ini¬ 
tially,  four  material  points  are  assigned  to  each  cell  at 
the  fine  level.  At  the  coarse  level,  the  portion  overlapped 
by  the  fine  level  is  assigned  with  1 6  material  points  per 
cell.  Hence,  these  material  points  have  the  same  size  and 
initial  positions  as  those  at  the  fine  level.  The  rest  of 
the  coarse  level  is  assigned  with  four  material  points  per 
cell  GIMP  provides  a  natural  coupling  of  the  material 
points  with  different  sizes  at  the  same  grid  level  This 
is  because  the  weighting  function  depends  on  the  char¬ 
acteristic  size  of  the  material  points  and  cell  length  and 
the  interpolation  between  the  nodes  and  material  points 
is  weighed  by  the  mass  of  the  material  point.  In  GIMP 
computation,  each  level  is  computed  independently  with 
the  physical  variables  communicated  through  the  ghost 
regions  between  neighboring  levels.  Two  data  exchange 
processes,  namely,  refinement  and  coarsening  are  used 
in  the  communication.  Refinement  process  passes  infor¬ 
mation  from  the  coarse  level  to  the  immediate  fine  level, 
while  coarsening  process  will  pass  information  from  fine 
level  to  the  next  coarse  level.  In  the  refinement  process, 
physical  variables  at  the  fine  material  points  inside  the 
thick  lines  in  Fig.  5  are  copied  directly  to  replace  the 
material  points  at  the  coarse  level  In  the  coarsening  pro¬ 
cess,  the  physical  variables  at  coarse  material  points  are 
copied  to  the  ghost  cells  of  the  immediate  fine  level 
In  the  refinement,  the  material  points  located  in  the  ghost 
cells  at  the  fine  level  are  eliminated  first,  and  the  mate¬ 
rial  points  in  the  corresponding  region  of  the  coarse  level 
(with  the  same  size  as  points  in  the  immediate  fine  level) 
are  copied  to  ghost  cells  at  the  fine  level  In  the  copy¬ 
ing  process,  if  some  (small)  material  points  in  the  coarse 


level  fall  into  the  interior  (inside  the  square  on  the  right  of 
Fig.  5)  of  the  immediate  fine  level  these  points  will  not 
be  copied  to  this  fine  level  as  points  in  the  fine  level  will 
be  able  to  carry  over  all  computations  in  the  interior  of 
the  fine  level  already.  In  coarsening,  the  material  points 
of  the  coarse  level  located  in  the  region  overlapped  by 
the  fine  level  (inside  the  square  on  the  left  of  Fig.  5)  are 
eliminated  first,  and  the  material  points  in  the  fine  level 
are  then  coaled  to  the  immediate  coarse  level  With  these 
refinement/coarsening  operations,  the  material  points  can 
move  around  freely,  including  moving  outside  the  origi¬ 
nal  level  in  large  deformations.  To  ensure  that  this  coars¬ 
ening  process  can  still  be  performed  reliably  during  de¬ 
formation,  sufficiently  wide  region  of  cells  should  be  as¬ 
signed  with  refined  material  points  at  the  coarse  level  so 
that  the  ghost  cells  of  the  fine  level  always  stay  within 
the  region  with  fine  materia!  points  on  the  coarse  level 
At  the  coarse  level,  the  interior  cells  covered  by  the  fine 
level  do  not  participate  in  the  computation  and  there  are 
no  material  points  inside  (see  Fig.  5). 

The  refinement  techniques  can  be  applied  for  multiple 
times  at  the  regions  of  interest,  such  as  the  stress  concen¬ 
tration  regions.  A  fixed  refinement  ratio  of  two  between 
two  neighboring  levels  is  very  effective  in  reducing  the 
total  number  of  computational  cells.  Fig.  5  shows  nested 
multi-level  refinement  and  its  corresponding  relation  be¬ 
tween  the  total  number  of  cells  and  the  number  of  grid 
levels.  The  cell  percentage  represents  the  ratio  of  the  total 
number  of  cells  with  multi-level  refinement  mesh  to  the 
total  number  of  cells  with  one-level  finest  mesh.  If  each 
fine  level  occupies  one  quarter  of  the  neighboring  coarser 
level  as  shown  in  Fig.  6  (a),  the  cell  percentage  as  a 
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Figure  6  :  Nested  multi-level  refinement  and  reduction 
in  the  number  of  cells  with  number  of  levels 


function  of  the  number  of  grid  levels  can  be  calculated, 
as  shown  in  Fig.  6  (b).  For  example,  when  totally  four 
levels  of  successive  refinements  are  used  the  total  num¬ 
ber  of  cells  is  about  8%  of  that  of  one  uniform  fine  mesh. 
A  reduction  in  the  number  of  computational  ceils  leads 
to  a  reduction  in  the  number  of  material  points.  Hence, 
the  total  amount  of  computational  time  can  be  reduced 
significantly.  However,  refinement  and  coarsening  com¬ 
munications  will  cost  additional  computational  time,  as 
will  be  discussed  in  Section  4. 

Another  advantage  of  the  multi-level  refinement  is  that  it 
allows  for  temporal  refinement.  Since  the  computation  at 
each  grid  level  is  conducted  independently,  different  time 
step  increments  can  be  used  for  computation  at  different 
levels.  For  example,  a  smaller  time  step  increment  can 
be  used  for  the  fine  level  to  improve  computational  accu¬ 
racy,  while  a  larger  time  step  increment  can  be  used  for 
the  coarse  level.  Since  the  refinement  ratio  is  an  integer, 
the  time  step  increment  ratio  should  also  be  an  integer 


for  convenience  in  the  computation  and  data  common i- 
cation/synchronizalion,  For  example,  in  Fig.  5,  when  the 
refinement  ratios  in  both  directions  are  fixed  at  two,  the 
time  step  increment  ratio  should  be  set  to  two  as  well.  As 
a  result,  two  time  step  computations  are  performed  at  the 
fine  level,  and  results  are  passed  over  to  the  immediate 
coarse  level  to  couple  with  the  results  at  the  coarse  level. 

33  Domain  Decomposition 

GIMP  uses  structured  mesh,  consistent  with  SAMRAl, 
so  that  domain  decomposition  is  straightforward  and  no 
remeshing,  in  general,  is  necessary.  Fig.  7  (a)  shows 
a  two-dimensional  computational  domain  decomposed 
into  two  patches  separated  by  a  horizontal  dash  line.  The 
elliptical  solid  object  with  different  boundary  conditions 
applied  at  different  regions  is  inside  this  domain/grid. 
After  discretization,  there  are  a  certain  number  of  ma¬ 
terial  points  and  part  of  the  boundary  in  a  patch,  which 
will  be  computed  individually.  It  may  be  noted  that  patch 
boundary  does  not  have  to  coincide  with  the  boundary  of 
the  material  continuum.  The  patch  boundary  is  always 
chosen  to  be  larger  than  the  region  occupied  by  the  mate¬ 
rial  continuum  so  that  there  is  extra  space  for  the  material 
to  deform.  This  will  not  cause  any  additional  computa¬ 
tional  burden  as  the  GIMP  computation  is  only  carried 
out  on  material  points  inside  the  patch.  Each  patch  can 
be  processed  by  a  single  processor  and  the  convenience 
in  creating  patches  will  provide  great  flexibility  in  paral¬ 
lel  processing. 

Communication  between  two  neighboring  patches  is  re¬ 
alized  through  information  sharing  in  the  region  over¬ 
lapped  by  the  two  patches.  The  overlapped  regions  are 
also  called  *  ghost*  regions,  as  shown  in  Fig.  7  (b).  The 
ghost  cells  are  denoted  by  dash  lines.  For  ease  of  vi¬ 
sualization,  only  the  ghost  cells  overlapped  by  the  other 
patch  are  shown  and  the  ghost  cells  along  the  other  three 
sides  of  a  patch  are  not  shown.  On  one  grid  level  patches 
can  communicate  with  each  other  by  simply  copying  data 
from  one  patch  to  another  at  the  same  computation  time 
(Fig.  7  (b)).  Using  the  material  point  information  from 
the  previous  lime  step,  and  the  physical  boundary  condi¬ 
tions,  each  patch  is  ready  to  advance  one  more  time  step. 
At  this  time,  the  material  point  information  in  the  outer¬ 
most  layer  of  the  ghost  celts  becomes  inaccurate.  For  in¬ 
stance,  one  outermost  grid  node  in  patch  one,  marked  by 
the  circle,  obtains  information  from  eight  material  points 
before  advancing  to  the  next  step.  After  advancing,  it 
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(a) 


Figure  7  :  A  computational  domain  of  two  patches  in  one  grid  level 


extrapolates  to  eight  material  points.  However,  in  patch 
two,  the  grid  node  at  the  same  location  obtains  informa¬ 
tion  from  sixteen  surrounding  material  points.  It  extrap¬ 
olates  to  these  sixteen  material  points  after  advancing. 
Typically,  after  each  step,  the  material  points  in  the  next 
inner  layer  in  the  ghost  region  become  inaccurate  as  well. 

Ghost  cells  and  material  points  are  attached  to  each  patch 
to  ensure  accuracy  of  the  interior.  Each  patch  can  be 
computed  independently  for  one  GIMP  step  since  the 
momentum  conservation  equation  is  solved  at  each  node 
and  there  are  no  coupled  equations  to  solve.  No  data  ex¬ 
change  is  necessary  during  the  GIMP  step.  Therefore, 
different  patches  can  be  assigned  to  different  processors 
for  parallel  processing.  After  one  GIMP  step,  the  data  in 
the  ghost  cells  will  be  updated. 

Copying  material  points  to  ghost  cells  involves  data  ex¬ 
change  between  processors,  which  costs  additional  time. 
The  more  the  number  layers  of  ghost  cells,  the  longer  the 
time  needed  for  communication,  but  communication  can 
be  performed  less  frequently.  A  minimum  of  two  layers 
of  ghost  cells  are  necessary  to  ensure  that  computation 
at  the  material  points  inside  a  patch  is  always  correct.  If 
three  levels  of  ghost  cells  are  chosen,  the  communica¬ 
tion  can  be  performed  after  every  two  increments  of  each 
patch. 

With  these  refinements  and  domain  decomposition 
schemes  for  GIMP,  it  is  possible  to  implement  GIMP  into 
the  SAMRA1  platform.  In  this  study,  the  refinement  ra¬ 
tio  is  chosen  as  two.  Four  layers  of  ghost  cells  are  aug¬ 
mented  to  a  patch  such  that  data  communications,  includ¬ 
ing  both  data  exchange  on  the  same  level  and  between 
neighboring  levels,  are  performed  every  two  time-steps 


for  each  fine  grid  level.  This  is  critical  because  data  ex¬ 
change  between  levels  has  to  be  performed  when  the  two 
levels  are  synchronized. 

Fig.  8  shows  the  flowchart  advancing  all  grid  levels  re¬ 
cursively  starling  from  the  coarsest  level  for  one  coarsest 
time  step.  It  may  be  noted  that  the  sequential  GIMP  al¬ 
gorithm  can  be  used  to  advance  each  patch  without  mod¬ 
ification. 

4  Numerical  Examples  and  Results 

A  Beowulf  Linux  duster  of  8  identical  PCs  were  used  in 
the  simulations.  Each  PC  has  a  Pentium  4  processor  with 
a  2,4  GHz  CPU,  512  MB  RAM  except  that  the  master 
node  has  a  memory  of  1  GB,  A  gigabit  switch  is  used  to 
connect  the  network. 

Two  examples  are  used  for  validation  of  the  2D  par¬ 
allel  GIMP  computing  under  SAMRAI  platform.  The 
first  example  is  simple  tension  of  polycrysraliine  sili¬ 
con  under  plane  strain  conditions.  The  material  is  as¬ 
sumed  to  be  homogeneous,  isotropic,  and  linear  elastic. 
The  Young’s  modulus  is  170  GPa  and  the  Poisson's  ra¬ 
tio  is  0,18,  One  end  is  constrained  along  the  X-  direc¬ 
tion  while  a  normal  traction  is  ramped  up  on  the  other 
end.  The  size  of  the  tensile  model  is  0,06  mm  x  0.04 
mm.  The  length  of  a  square  grid  cell  is  0.002  mm  and  the 
time  step  is  5x  104  ps.  For  verification,  the  same  prob¬ 
lem  was  simulated  using  both  conventional  MPM  and 
FEM  (ABAQUS/Explicit).  Fig.  9  shows  GIMP.  MPM 
and  FEM  simulation  results  of  normal  stresses  in  the  X- 
directson  at  different  increments  from  a  simple  tension 
problem.  The  simulation  using  the  conventional  MPM  in 
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Figure  H  ;  Flowchart  showing  advancement  of  grid  levels  recursively  starting  from  the  coarsest  to  the  finest  level  in 
GIMP 


Fig.  9  (a)  shows  material  separation  dose  to  the  free  end 
with  severe  numerical  instability  after  275  increments. 
Fig.  9  (b)  and  (c)  show  the  normal  stress  distribution 
in  the  tensile  direction  and  deformation  after  500  incre¬ 
ments  from  FEM  and  GIMP  It  may  be  noted  that  FEM 
results  show  a  stress  contour  plot  on  the  deformed  mesh 
while  the  GIMP  results  show  a  discrete  scattered  plot  of 
material  points.  These  two  results  are  in  good  agreement 
with  the  difference  in  the  maximum  value  being  less  than 
10%. 

In  the  second  example,  indentation  on  the  same  silicon 
material  is  simulated.  The  workpiece  is  subjected  to  a 
pressure  applied  in  the  middle  of  the  top  surface  (Fig. 
10  (a))  under  plain  strain  conditions  with  a  thickness  of 
0,001  mm  for  computing  the  mass  and  forces.  The  mag¬ 
nitude  of  the  pressure  increases  linearly  with  time  for  the 
first  1500  increments,  and  is  then  kept  constant  (see  Fig. 
IQ  (b)).  The  cell  size  is  0,001  mm  in  both  directions  and 
the  time  step  is  20  ps  for  both  FEM  and  GIMP  simu¬ 
lations.  Due  to  symmetry,  only  one  half  of  the  work- 
piece  is  modeled.  This  simulation  is  performed  with  two 
patches  in  one  uniform  grid  level.  Two  processors  are 
used  and  one  patch  is  assigned  to  each  processor.  Fig.  1 1 


shows  GIMP  and  FEM  results  of  normal  stresses  in  the 
Y-direction  at  different  increments.  The  dashed  line  in 
Fig,  1 1  (a)  is  the  boundary  between  the  two  patches.  Fig, 
l  i  (a)  and  (b)  are  plots  of  normal  stresses  in  Y-direction 
at  500  time  increments  for  GIMP  and  FEM  simulations. 
The  difference  in  stress  values  in  Fig.  1 1  (a)  and  (b)  is 
less  than  5%.  It  should  be  noted  that  the  FEM  simula¬ 
tion  aborted  at  1 348  increments  due  to  excessive  element 
distortion.  The  GIMP  simulation  did  not  encounter  this 
problem.  Fig.  1 1  (c)  shows  the  GIMP  stress  result  af¬ 
ter  2000  increments.  This  demonstrates  the  capability  of 
GIMP  in  handling  excessive  distortions. 

In  order  to  validate  the  multi-level  refinement  algorithm 
and  parallel  communication,  as  well  as  the  proposed  con¬ 
tact  algorithm,  a  simulation  of  nanoindentation  with  a 
wedge  indenter  was  conducted  under  2D  plane  strain 
conditions.  The  workpiece  is  aluminum  and  the  inden- 
tcr  is  assumed  to  be  rigid.  Fig,  1 1  shows  the  indentation 
model.  The  area  below  the  indenter  where  high  stress 
gradients  are  expected  is  refined,  as  shown  in  Fig.  12 
(a),  A  prescribed  velocity  was  applied  on  the  indenter, 
as  shown  in  Fig.  12  (b).  The  work  piece  dimensions 
are  60  jiin  x  40  jito.  It  is  fixed  in  the  Y-direction  at  the 
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Figure  9  :  Simulation  results  of  tensile  stress  contours 
for  a  simple  tension  problem 


bottom.  Only  half  of  the  model  is  simulated  because  of 
symmetry.  The  cel  I  si  zes  are  500  nm,  250  n  m  and  1 25  nm 
for  levels  1,  2  and  3,  respectively.  Each  level  is  divided 
into  four  patches  with  approximately  the  same  size-  The 
maximum  indentation  depth  in  the  simulation  is  about 
450  nm.  The  dotted  lines  in  Fig-  12  (a)  illustrate  the  four 
patches  in  level  l .  For  comparison,  an  explicit  FEM  sim¬ 
ulation  (using  A I3AQU  S/Exp  licit)  was  carried  out  under 


(a)  (b) 

Figure  10  :  Loading  conditions  for  a  simple  indentation 
problem 


the  same  conditions.  The  FEM  element  size  is  uniform 
and  is  the  same  size  as  the  finest  GIMP  background  grid 
size.  In  this  example,  the  maximum  indentation  depth 
(450  nm)  is  relatively  small  compared  to  the  finest  cel! 
size,  so  that  FEM  simulation  has  not  encountered  exces¬ 
sive  mesh  distortion. 

Fig.  13  shows  a  comparison  of  contours  of  normal 
stresses  in  the  Y-direction  at  the  maximum  depth  for  both 
FEM  and  parallel  GIMP  simulations.  The  axis  of  sym¬ 
metry  of  the  workpiece  is  located  at  XjO,Q3  mm.  For 
FEM,  the  plot  is  the  contour  of  nodal  stresses  with  de¬ 
formed  positions,  and  for  GIMP,  it  is  a  discrete  scattered 
plot  of  stress  at  deformed  material  points.  The  area  be¬ 
low  the  indenter  with  high  stress  gradients  is  refined  as 
shown  in  Fig.  12  (a)  for  parallel  GIMP  computation.  The 
borders  of  grid  levels  2  and  3  can  barely  be  seen  in  Fig. 

13  (b)  due  to  the  use  of  high  material  point  density.  Fig. 

14  is  a  close-up  view  of  shear  stresses  in  which  the  three 
grid  levels  are  shown.  Results  show  that  the  normal  and 
shear  stresses  from  both  parallel  GIMP  and  FEM  sim¬ 
ulations  agree  very  well.  The  difference  of  the  normal 
stresses  in  the  Y-direction  for  the  material  point  in  con¬ 
tact  with  the  indenter  tip  and  the  stress  of  the  FEM  node 
at  the  same  location  is  4.4%.  It  may  be  noted  that  some 
non-smoothness  in  the  GIMP  stresses  around  the  level 
boarders  can  be  seen.  This  non -smoothness  is  caused  by 
the  refinement  and  coarsening  and  the  ermr  associated 
with  this  is  negligible  for  these  simulations, 

GIMP  simulations  using  a  uniform  cell  size  of  500  nm 
and  1 25  nm  were  performed  under  the  same  conditions  as 
in  Fig.  12  to  further  verify  the  refinement/coarsening  al¬ 
gorithm.  Fig.  1 5  shows  normal  stresses  in  the  Y-direction 
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(a)  GIMP  at  500  increments 


X  (mm) 


(b)  FEM  ai  500  increments 


X  (mm) 


(c)  GIMP  at  2<XX)  increments 

Figure  1 1  :  GIMP  and  FEM  results  of  normal  stress  vari¬ 
ation  in  the  Y-dtrection  at  different  increments 


(b> 

Figure  12  ;  Schematic  of  2D  indentation  showing  (a) 
three  levels  of  refinement  and  (b)  the  indenter  velocity 
history 


and  shear  stresses  in  GIMP  simulations  using  500  nm 
uniform  cells.  In  this  case  the  material  points  in  contact 
with  the  indenter  are  1 6  times  {4  times  in  each  direction) 
larger  than  those  with  two  levels  of  refinements.  In  gen¬ 
eral  the  stress  magnitudes  agree  with  those  in  Fig.  13 
and  Fig.  14. 

Fig,  16  (a)  shows  indentation  load  versus  depth  curves 
from  FEM  and  GIMP  with  different  grid  sizes.  From 
GIMP  computations,  the  load  versus  depth  curves  with 
three  levels  of  refinement  agree  very  well  with  the  re¬ 
sults  from  a  uniform  finest  mesh.  The  load  versus  depth 
curve  from  the  FEM  simulation  with  a  uniform  cell  size 
of  125  nm  under  the  same  boundary  conditions  is  plot¬ 
ted  for  comparison.  It  can  be  seen  from  Fig.  1 6  (a)  that 
the  trend  of  the  load  versus  depth  plots  from  FEM  and 
GIMP  simulations  are  similar.  The  difference  in  inden¬ 
tation  load  at  the  end  of  loading,  which  corresponds  to 
450  nm  of  indentation  depth,  is  5.9%  between  FEM  and 
GIMP  with  3  grid  levels.  When  the  depth  is  less  than 
100  nm,  there  is  only  one  material  point  in  contact  with 
the  rigid  indenter  The  assumption  of  constant  pressure 
causes  large  differences  under  this  circumstance.  How- 
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Figure  13  :  Comparison  of  normal  stresses  in  Y-direction 
from  FEM  and  GIMP 


ever  if  the  GIMP  cell  size  is  further  refined  to  62*5  nm 
am)  the  size  of  the  material  point  is  3 1 ,25  nm,  the  differ¬ 
ence  between  GIMP  and  FEM  becomes  smaller  as  can 
be  seen  in  Fig,  16  (b). 

Other  simulations  were  conducted  for  the  same  problem 


(a)  FEM 


(b)  GIMP 


Figure  14  :  Comparison  of  shear  stresses  from  FEM  and 
GIMP 


with  three  levels  of  refinement  using  different  number  of 
processors  to  test  the  efficiency  of  parallel  computing. 
The  number  of  patches  at  each  level  is  the  same  as  the 
number  of  processors  and  the  size  of  each  patch  is  ap¬ 
proximately  the  same.  The  resultant  stress  distribution 
and  indentation  load  versus  depth  plots  are  the  same  as 
the  previous  results.  The  average  time  per  computational 
step  is  7.14  sec.  when  one  processor  is  used  and  is  re¬ 
duced  to  4.26,  3.40,  2.18  sec,,  respectively  when  two, 
three,  and  four  processors  are  used.  When  four  proces¬ 
sors  are  used,  the  CPU  time  per  step  is  only  30,5%  of 
that  of  one  processor.  This  gives  a  speed-up  by  a  factor 
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(a)  Normal  Stress 


(b)  Shear  Stress 

Figure  IS  :  Normal  and  shear  stresses  of  GIMP  simula¬ 
tions  with  a  uniform  cell  size  of  500  nm 


of  3.28,  In  the  ideal  case  without  communication  over¬ 
head,  the  speed-up  would  be  4,  The  reduction  in  speed¬ 


up  from  the  ideal  number  is  because  of  the  time  involved 
in  data  communication  between  processors.  It  has  been 
observed  that  the  refinement  and  coarsening  algorithm 
consume  most  of  the  communication  time.  Moreover, 
in  refinement  and  coarsening,  most  of  the  time  is  taken 
to  search  for  the  corresponding  material  point  in  another 
grid  level.  This  portion  of  the  computational  time  can  be 
reduced,  if  improved  searching  algorithm  or  more  opti¬ 
mized  algorithm  for  the  storage  of  material  points  can  be 
implemented. 

The  manual  refinement  for  the  indentation  problem  is  ad¬ 
equate  since  the  region  of  high  stress  gradient  is  known 
to  occur  below  the  indented  The  finest  level  covers  the 
indenter  and  part  of  the  specimen.  With  the  same  ini¬ 
tial  condition,  the  results  at  the  finest  level  is  identical  to 
the  results  in  the  same  area  if  a  uniform  fine  mesh  is  used 
for  the  entire  domain  that  requires  much  longer  computa¬ 
tional  time.  The  computational  load  of  each  processor  is 
balanced  statically  by  assigning  approximately  the  same 
number  of  material  points  to  each  processor.  Dynamic 
load  balance  is  supported  by  SAMRAI  and  can  poten¬ 
tially  improve  the  efficiency  of  the  simulation. 

To  demonstrate  the  capability  of  the  algorithm  developed 
in  this  investigation  for  multiscale  simulation,  an  indenta¬ 
tion  model  with  multiple  length  scales  is  simulated  with 
eight  processors.  The  dimensions  of  the  workpiece  are 
0,25  mm  x  0.125  mm.  Initially,  the  velocity  of  the  in¬ 
denter  increases  from  0  to  150  m/s  linearly  with  time 
and  is  then  kept  constant.  Five  successive  levels  of  re¬ 
finement  are  used  in  this  simulation.  The  smallest  ma¬ 
terial  point  represents  an  area  of  64  nm  x  64  nm,  and 
the  largest  materia!  point  covers  an  area  of  l  pm  x  l 
pm.  Each  level  is  divided  into  8  equal -sized  patches  for 
best  load  balance.  Since  the  contact  surface  can  evolve 
into  several  patches,  a  parallel  solver  is  implemented  to 
solve  Eq,  (l  I)  to  find  the  contact  pressure  based  on  the 
Portable,  Extensible  Toolkit  for  Scientific  Computation 
(PETSc),  An  aluminum  workpiece  is  chosen  with  the 
Young's  modulus  and  Poisson's  ratio  of  70  GPa  and  0.33, 
respectively.  The  maximum  indentation  depth  was  9,8 
pm  in  this  simulation  (i.e.,  153  times  the  size  of  the  finest 
material  point).  It  took  nine  hours  to  simulate  this  prob¬ 
lem  with  eight  processors.  Fig.  17  (a)  gives  the  normal 
stress  distributions  and  Fig,  17  (b)  shows  normal  stress 
distribution  for  the  finest  two  levels.  The  relative  large 
deformation  in  this  multiscale  nanoindentation  problem 
could  not  be  handled  by  FEM  due  to  excessive  distortion 
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Figure  16  :  Indentation  toad  versus  depth  curves  from 
FEM  and  GIMP  with  different  grid  sizes 
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in  the  FEM  mesh.  However,  the  parallel  GIMP  code  was 
able  to  complete  the  entire  loading/unloading  processes 
without  any  difficulty.  This  example  shows  clearly  the 
advantage  of  GIMP  for  multiscale  simulations  over  FEM. 


(b)  GIMP 

Figure  17  :  Multiscale  simulation  of  nanoindentation 
with  five  levels  of  refinement 


5  Conclusions 

The  following  are  specific  conclusions  based  on  the  re¬ 
sults  of  this  investigation; 

1 .  A  2D  generalized  interpolation  materia!  point  (GIMP) 
method  has  been  implemented  to  address  problems,  such 
as  particle  flying-off  and  alternating  stress  sign  associ¬ 
ated  with  conventional  MPM  in  case  of  relatively  large 


deformation. 

2,  To  conduct  multiple  length  scale  simulations,  a 
parallel  computing  scheme  has  been  presented  using 
GIMP  under  SAMRAI  parallel  computing  environment 
in  which  multi-level  grids  are  used  for  spatial  and  tem¬ 
poral  refinements. 

3.  A  refinement/coarsening  algorithm,  based  on  material 
points  of  GIMP  in  two  grid  levels,  has  been  developed 
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for  communication  between  neighboring  grid  levels  of 
different  refinements.  With  increase  in  the  refinement 
levels,  as  well  as  decrease  in  the  time  step  increments, 
the  computational  accuracy  is  greatly  improved  in  the  re¬ 
gion  of  interest  while  the  overall  computational  time  is 
reduced.  The  computation  at  each  grid  level  is  performed 
recursively  to  ensure  that  the  refinement  and  coarsening 
are  performed  when  the  two  neighboring  levels  are  syn¬ 
chronized. 

4,  2D  MPM  and  GIMP  were  applied  to  simple  ten¬ 
sion  and  indentation  problems  to  validate  the  GIMP  al¬ 
gorithm.  GIMP  results  agree  very  well  with  FEM  results 
for  these  two  examples  provided  that  the  deformations 
are  small.  The  noise  and  instability  problems  present  in 
conventional  MPM  are  not  observed  in  the  GIMP  simu¬ 
lations. 

5,  As  the  deformation  is  increased,  GIMP  continued  to 
execute  while  FEM  aborted  due  to  element  distortion. 
Also  GIMP  results  are  stable.  Thus  GIMP  is  able  to  han¬ 
dle  relatively  large  deformation  problems. 

6,  For  the  nanoindentation  problem,  a  GIMP  algorithm 
for  the  contact  between  a  rigid  indenter  and  a  deformable 
workpiece  was  developed.  A  reasonably  good  agreement 
between  GIMP  and  FEM  results  was  reached,  validating 
the  contact  algorithm  presented  in  this  investigation, 

7,  Another  nanoindentation  example  with  multiple 
length  scales  from  a  few  nanometers  to  sub-millimeters 
was  simulated  and  numerical  results  validated  the  paral¬ 
lel  GIMP  computing  with  the  use  of  SAMRAI. 
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Structured  Mesh  Refinement  in  Generalized  Interpolation  Material  Point  (GIMP) 
Method  for  Simulation  of  Dynamic  Problems 

Jin  Ma,  Houghing  Lu,  and  Ranga  Komanduri 1 


Abstract:  The  generalized  interpolation  material  point 
(GIMP)  method,  recently  developed  using  a  C*  continu¬ 
ous  weighting  function,  has  solved  the  numerical  noise 
problem  associated  with  material  points  just  crossing  the 
cell  borders,  so  that  it  is  suitable  for  simulation  of  rela¬ 
tively  large  deformation  problems.  However,  this  method 
typically  uses  a  uniform  mesh  in  computation  when  one 
level  of  material  points  is  used,  thus  limiting  its  effec¬ 
tiveness  in  dealing  with  structures  involving  areas  of 
high  stress  gradients.  In  this  paper,  a  spatial  refinement 
scheme  of  the  structured  grid  for  GIMP  is  presented  for 
simulations  with  highly  localized  stress  gradients,  A  uni¬ 
form  structured  background  grid  is  used  in  each  refine- 
mem  zone  for  interpolation  in  GIMP  for  ease  of  generat¬ 
ing  and  duplicating  structured  grid  in  parallel  processing. 
The  concept  of  influence  zone  for  the  background  node 
and  transitional  node  is  introduced  for  the  mesh  size  tran¬ 
sition,  The  grid  shape  function  for  the  transitional  node 
is  modified  accordingly,  whereas  the  computation  of  the 
weighting  function  in  GIMP  remains  the  same.  Two 
other  issues  are  also  addressed  to  improve  the  GIMP 
method.  The  displacement  boundary  conditions  are  in¬ 
troduced  into  the  discretization  of  the  momentum  conser¬ 
vation  equation  in  GIMP,  and  a  method  is  implemented  to 
track  the  deformation  of  the  material  particles  by  tracking 
the  position  of  the  particle  corners  to  resolve  the  prob¬ 
lem  of  artificial  separation  of  material  particles  in  GIMP 
simulations.  Numerical  simulations  of  several  problems, 
such  as  tension,  indentation,  stress  concentration  and 
stress  distribution  near  a  crack  (mode  I  crack  problem) 
are  presented  to  verify  this  refinement  scheme. 
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1  Introduction 

The  material  point  method  (MPM)  uses  a  collection  of 
material  points,  mathematically  represented  by  Dirac 
delta  functions  to  represent  a  materia!  continuum  (Sul- 
sky,  Zhou,  and  Schreyer  (1995);  Hu  and  Chen  (2003); 
Gull  key  and  Weiss  (2003)),  A  spatially  fixed  background 
grid,  and  interpolation  between  grid  nodes  and  mate¬ 
rial  points  are  introduced  to  track  physical  variables  car¬ 
ried  by  the  material  points  in  the  Lagrangian  descrip¬ 
tion.  Field  equations  are  solved  on  the  background  grid 
in  the  Eulerian  description.  Physical  variables  are  inter¬ 
polated  from  the  solutions  on  the  background  grids  to 
material  points  back  and  forth  for  solution  and  convec¬ 
tion  of  physical  variables.  In  general,  the  isoparametric 
shape  functions,  same  as  those  used  in  the  finite  element 
method  (F^I),  are  used.  As  the  MPM  simulation  is  in¬ 
dependent  of  the  background  grid,  a  structured  grid  is 
usually  employed  for  purposes  of  simplicity.  The  move¬ 
ment  of  the  materia]  points  represents  the  deformation  of 
the  continuum,  MPM  has  been  demonstrated  to  be  capa¬ 
ble  of  handling  large  deformations  in  a  natural  way  (Sul- 
sky,  Zhou,  and  Schreyer  ( 1995)).  However,  primarily  due 
to  the  discontinuity  of  the  gradient  of  the  interpolation 
function  at  the  borders  of  the  neighboring  cells,  artificial 
noise  can  be  introduced  when  the  material  points  move 
just  across  the  grid  cell  boundaries,  leading  to  simulation 
instability  for  MPM,  The  generalized  interpolation  ma¬ 
terial  point  (GIMP)  method,  introduced  by  Bardenhagen 
and  Kober  (2004)  can  resolve  this  problem.  In  GIMP 
a  C1  continuous  interpolation  function  is  used  and  each 
material  point/p  article  occupies  a  region.  GIMP  has  been 
demonstrated  to  be  stable  and  capable  of  handling  rela¬ 
tively  large  deformations  (Ma,  Lu,  Wang,  Roy,  Homung, 
Wissink,  and  Komanduri  (2005)), 

The  current  MPM  typically  uses  a  uniform  background 
mesh  for  solving  the  field  equations.  However,  this  is 
not  efficient  when  stress  gradients  are  high  such  as  stress 
concentrations  in  a  plate  with  a  hole,  or  the  stress  field  of 
a  workpiece  under  indentation.  In  contrast,  transitional 
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mesh  is  effective  in  solving  problems  involving  rapidly 
varying  stress  in  an  area.  Wang,  Karuppiah,  Lu,  Roy, 
and  Komanduri  (2005)  have  presented  a  method  using 
an  irregular  background  mesh  to  deal  with  problems  in¬ 
volving  rapidly  varying  stress,  such  as  stress  field  near  a 
crack.  However,  this  approach  does  not  use  regular  struc¬ 
tured  background  mesh  so  that  mesh  generation  encoun¬ 
ters  the  same  difficulty  as  FEMT  and  leads  to  the  loss  of 
some  advantage  of  MPM  on  the  ease  of  generating  mesh 
for  a  complex  problem. 

The  use  of  structured  grid  in  GIMP  has  facilitated  the 
implementation  of  GIMP  in  parallel  processing.  A  re¬ 
finement  scheme  based  on  splitting  and  merging  material 
particles  was  proposed  by  Tan  and  Naim  (2002).  Re¬ 
cently,  a  multilevel  refinement  algorithm  has  been  devel¬ 
oped  for  parallel  processing  using  the  structured  adap¬ 
tive  mesh  refinement  application  infrastructure  (SAM- 
RAI)  (Homtmg  and  Kohn  (2002);  Ma,  LuT  Wang,  Roy, 
Homung,  Wissink  and  Komanduri  {2005}).  The  compu¬ 
tational  domain  was  divided  into  multiple  nested  levels 
of  refinement.  Each  grid  level  is  uniform  but  has  a  differ¬ 
ent  cell  size.  Smaller  materia!  particles  and  smaller  cell 
sizes  are  used  in  each  finer  level.  Two  neighboring  lev¬ 
els  are  connected  by  overlapped  material  particles  of  the 
same  size  and  data  communication  between  levels  is  per¬ 
formed  at  predefined  intervals.  However,  the  refinement 
through  material  panicles  requires  extra  communication 
and  simulation  time.  In  this  paper,  a  refinement  for  GIMP 
based  on  the  transitional  grid  nodes  is  developed.  This 
refinement  is  natural  and  does  not  involve  extra  simula¬ 
tion  time.  Moreover,  the  refined  grid  remains  uniformly 
structured  in  each  refinement  region. 

While  the  problem  associated  with  artificial  noise  has 
been  resolved  with  the  use  of  GIMP  method,  it  has  been 
observed  recently  that  material  separation  could  occur  if 
the  deformation  of  the  material  particles  was  not  tracked, 
Guilkey  (2005).  Tracking  the  deformation  of  material 
particles  properly  in  GIMP  is  necessary  especially  when 
the  material  particles  are  stretched.  In  this  paper,  an  ap¬ 
proach  is  developed  for  tracking  the  particle  deformation 
to  resolve  material  point  separation  problem.  This  pa¬ 
per  focuses  on  the  refinement  scheme  for  structured  grid. 
Several  numerical  problems,  such  as  tension,  indenta¬ 
tion,  stress  concentration  and  stress  distribution  near  a 
crack  (mode  l  crack  problem)  were  simulated  to  verify 
this  refinement  algorithm,  as  well  as  to  demonstrate  the 
effectiveness  of  tracking  particle  deformations. 


2  GIMP 

For  the  purpose  of  completeness,  the  basic  equations  in 
GIMP  (Bardenhagen  and  Kober  (2004))  are  summarized 
here.  In  dynamic  simulations,  the  mass  and  momentum 
conservation  equations  are  given  by 

^  +  pV .  v  =  0,  and  (1) 

at 

pa  =  Va-j-b  in  £2,  (2) 

where  p  is  the  material  density,  a  is  the  acceleration,  a 
and  b  are  the  Cauchy  stress  and  body  force  density,  re¬ 
spectively.  The  displacement  and  traction  boundary  con¬ 
ditions  are  given  as 

u  =  uondQu,  (3) 


X  =  Xon  dQx, 


(4) 


where  C  C  d£l  and  d£2„  n<Klx  =  0.  In  vari¬ 

ational  form,  the  momentum  conservation  equation  can 
be  written  as 


J pa-Svdx 
n 

=  J  Va^Sve/x-f-  j  b*5 vdx-a  J  (ti  -u)  -5vdx,  (5) 


where  5v  is  an  admissible  velocity  field,  a  is  a  penalty 
parameter  we  introduce  herein  to  impose  the  essential 
boundary  conditions  and  a>>  I,  Atluri  and  Zhu  (1998), 
Atluri  and  Zhu  (2000).  Applying  the  chain  rule,  V  a  ■ 
5v  —  V  ■  (a  ■  5v)  -  a  :  V8v,  and  the  divergence  theorem, 
Eq.  (5)  can  be  written  as 

J  pa  ■  5 vdx  +  j  a  :  VSvdx 
n  a 

-  fhbvdx  +  j  x  b\dS 

n  an, 

+  J  xu  6\dS~a  J  (U“U)  -5v^5  (6) 

where  xu  is  the  resultant  traction  due  to  the  displacement 
boundary  condition  on  In  GIMP,  the  domain  Q.  is 
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discretized  into  a  collection  of  material  particles,  with  Qp 
as  the  domain  of  particle  p.  The  physical  quantities,  such 
as  the  mass,  stress  and  momentum  can  be  defined  for 
each  particle-  For  example,  the  momentum  for  particle 
p  can  be  expressed  as  pp  =  /  p(x)v(x)xP(x)t/x,  where 

ft/* 

v(x)  is  the  velocity  and  x^(x)  is  the  particle  characteris¬ 
tic  function.  The  momentum  conservation  equation  can 
be  discretized  as 

X,  [  ^  •  Svt/x  +  X  /  °pXp  :Svtfx 
p  nnnr  p  p  nrx*p 

“X  [  mp^p  b  *  5vdx  +  ^  /  x  ■  b\dx 

Pnrap  p  pdntnop 

+  X  /  ttt*5vrfS-o^  I  (u-u)-$\dS  (7) 

/J  dQitnaIt  p  an^nn,, 

where  Vp  =  /  £p{x)tfx  is  the  particle  volume.  Intro- 
nnQr 

during  a  background  grid  and  the  grid  shape  function 

$<*) 

5j(x)  that  satisfies  partition  of  unity  X  =  I*  the  admis- 

i 

sible  velocity  field  can  be  represented  by  the  grid  nodal 

data  as  5v  =  X8v*Sf(x).  Without  the  loss  of  generality, 
i 

we  take  u  in  Eq.  (7)  to  be  the  displacement  of  the  bound¬ 
ary  particles  at  the  current  time  step  and  tw  —  oFnu  where 
n„  is  the  unit  outward  normal  to  The  momentum 
conservation,  Eq.  (7),  can  eventually  be  written  for  each 
node  I  as 

pf  =ff»+f*  +  fT  +  f?,  (8) 


given  as 
2 


5ip  =  V  /  Xp(x)^(xMx' 


£ina„ 


(10) 


The  weighting  function  in  GIMP  is  C1  continuous  and 
satisfies  partition  of  unity*  The  momentum  conservation, 
Eq*  (8)t  can  be  solved  at  each  node  to  update  the  nodal 
momentum,  acceleration,  and  velocity.  These  updated 
nodal  quantities  can  be  interpolated  to  the  material  par¬ 
ticles  to  update  the  particles,  as  given  by  Bardenhagen 
and  Kober  (2004)*  It  may  be  noted  that  the  mass  of  each 
material  particle  does  not  change,  so  that  the  mass  con¬ 
servation  equation  is  satisfied  automatically. 

In  the  discretization  of  the  weak  form  of  the  momentum 
conservation  equation,  a  background  grid  is  used*  How¬ 
ever,  the  computation  is  independent  of  the  grid  from  one 
increment  to  another.  Hence  a  spatially  fixed  structured 
grid  can  be  used  for  convenience.  In  the  background  grid, 
no  nodal  connectivity  is  required  and  the  integration  is 
never  performed  on  the  element  domain.  Similar  char¬ 
acteristics  have  been  reported  for  other  meshless  meth¬ 
ods,  such  as  the  meshless  local  Petrov-Galerkin  (MLPG) 
method,  Atluri  and  Shen  (2002). 

For  a  uniform  structured  grid,  the  grid  shape  function  in 
3D  is  defined  as  the  product  of  three  nodal  tent  functions 
(Bardenhagen  and  Kober  (2004)) 

*(x)  =  SfWSf{4sf(z),  do 


in  which  the  nodal  tent  functions  are  in  the  same  form, 
e.g*, 


where  the  time  rate  of  change  of  nodal  momen¬ 
tum  pj  —  X^pp/Ar,  the  nodal  internal  force  vec- 

p  _ 

tor  fj™  —  the  nodal  body  force  vector 

p 

ff  =  X™^b*V  anc^  [he  traction  force  vector  — 

p 

X  /  TSj(x)dS.  fy  is  the  force  vector  induced  by  the 
essential  boundary  condition  given  by 

*!'  =  X  /  ffPnuSi{x)dS-a£  J  (uP~u)Sj(x)clS. 

p  cKi.nti,,  p  3Q„nn„ 

(9) 

Sin  is  weighting  fund  ion  between  particle  p  and  node  i 


m = 


(0  x—Xj<  —Lx 

1+0* -Xi)jLx  -Lx<x-Xj<  0 

\-{x-Xi)jLx  0  <x-Xi<Lx 
t  0  ^  x  ~~  X( 


(12) 


Fig.  I  shows  one  2D  grid  cell  with  four  nodes.  In  this 
paper,  the  panicle  characteristic  function  of  the  material 
particle  located  at  (xp,  y,,)  is  taken  as 

x$0-jgM#(y).  (i3) 

where  XpM  =  H[x  -  (x,,  - /,)]  -  H\x-  (xp  +  /,)]  and  H 
denotes  the  step  function* 
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Figure  1  ;  2D  representation  of  a  panicle  and  a  grid  cell 


3  Structured  mesh  refinement 

In  this  section*  a  refinement  scheme  for  a  structured  mesh 
in  GIMP  is  described.  Since  GIMP  shares  some  char¬ 
acteristics  with  meshless  methods,  we  expect  the  GIMP 
method  to  have  h  convergence  in  simulation  (Allan  and 
Shen  (2002)}.  The  momentum  conservation  equation  is 
essentially  solved  at  each  node  (see  Eq.  (8)).  Therefore, 
the  number  of  equations  to  be  solved  is  the  same  as  the 
number  of  nodes.  Finer  grid  and  smaller  material  par¬ 
ticles  will  lead  to  more  accurate  results*  In  some  simu¬ 
lations*  high  stress  gradients  exist  in  small  regions*  For 
instance*  in  indentation  with  a  sharp  tip*  the  stress  gra¬ 
dient  is  high  in  the  workpiece  beneath  the  indenter  tip. 
In  simulation  of  fracture  problems*  the  stress  gradient  at 
the  crack  tip  is  high  and  of  particular  interest.  Conse¬ 
quently*  finer  grid  is  needed  for  these  regions;  but  away 
from  these  regions,  a  coarser  grid  can  be  used  to  reduce 
the  computational  cost*  In  conclusion,  a  uniform  grid  can 
be  either  too  computationally  expensive  if  it  is  too  fine* 
or  inaccurate*  if  it  is  coarse.  A  non-uniform  grid  with  re¬ 
finement  can  provide  accurate  results  while  minimizing 
the  overall  computational  lime. 

Grid  refinement  should  maintain  the  same  characteristics 
of  the  structured  grid  as  much  as  possible*  in  order  to 
replicate  the  grid  generation  in  parallel  processing.  The 
proposed  refinement  scheme  is  illustrated  in  Fig.  2  with 
one  particle  per  cell  assigned.  The  material  particles  that 
fill  each  cell  are  square  in  nature  but  denoted  as  circles 
for  clarity.  To  understand  this  grid*  one  can  consider  that 
there  are  two  overlapped  structured  grids.  The  coarse 
grid  covers  a  rectangular  region  from  (0,  0)  to  (8*  6}  and 
the  fine  grid  covers  a  region  from  (2*  2)  to  (6,  6).  For 
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Figure  2  :  Refinement  of  structured  grid  with  a  refine¬ 
ment  ratio  of  two 


each  grid*  the  shape  function  can  be  evaluated  from  Eq, 
( 1 2),  To  be  consistent  with  any  other  general  refinement, 
it  is  required  that  the  region  of  the  fine  grid  to  be  smaller 
than  the  coarse  grid. 

When  these  two  grids  are  merged  into  one,  the  shape 
function  and  the  weighting  function  for  the  nodes  at 
the  boundary  of  the  finer  grid,  for  example*  the  nodes 
at  (2*  2)  and  (2*  3)  should  be  changed.  These  nodes 
are  called  transition  nodes.  To  facilitate  the  computa¬ 
tion  of  the  interpolation  function*  we  define  an  influ¬ 
ence  zone  for  each  node,  denoted  as  [L~  *  L*  *  ,  L+]  in 

2D  or  \L~  ,  i  +  ,  1  +  T  Lr,  L+\  in  3D.  The  symbols  in 

the  square  bracket  define  the  size  of  the  influence  zone* 
whereas  the  subscript  denotes  the  coordinate  axis  and  the 
superscript  denotes  the  direction  of  the  axis.  For  exam¬ 
ple,  L~  and  L+  represent  the  sizes  in  the  negative  and 
positive X  direction,  respectively.  The  influence  zone  for 
each  node  in  2D  is  rectangular  and  it  extends  to  the  next 
immediate  grid  tine  to  the  left*  right,  bottom  and  top  of 
the  node.  If  no  more  grid  lines  exist  in  any  direction,  such 
as  the  boundary  nodes,  the  size  is  zero  in  that  direction. 
For  example,  in  the  refined  grid  in  Fig.  2,  the  influence 
zones  for  the  nodes  at  (2,  3)  and  (2*  4)  are  both  [2*  I,  1* 
l },  The  influence  zone  for  the  node  at  (2*  2)  is  [2,  1 , 2*  1  ]. 
Based  on  this  definition*  the  influence  zone  for  this  node 
in  the  coarse  grid  is  [2*  2*  2*  2J*  and  in  the  uniformly  fine 
grid  is  [0*  1*0*  !]. 

Based  on  the  influence  zone*  the  nodal  tent  function  in 
each  direction  can  be  modified  as*  for  example,  in  the  X 
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(a)  Influence  zone:  [K  U,  1] 


(b)  Influence  zone:  [I,  0.5,  t.  1] 


Figure  3  :  Effect  of  influence  zone  on  the  weighting  functions 


(a)  No  particle  deformation 


(b)  Actual  particle  deformation 


Figure  4  :  Schematics  showing  overlaps  and  gaps  that  may  occur  when  particle  deformation  is  not  tracked 


direction, 


s;(x)  =  < 


o  x-xj  <  -l; 

i  +  {x - Xj)/L;  -l;  <  x - x,  <  o 

l-{x-x,)/t+  0  <  x  — x,-  <  Lx 
0  Lt  <x— Xj 


(14) 


Eq.  (14)  can  be  substituted  into  the  grid  shape  function 
(Eq.  ( 1 1 )},  and  the  weighting  function  between  the  parti¬ 
cle  p  and  the  node  i  can  be  evaluated  as 


0 

B  <  -L-x  , 

k-a  +  {hJ-a2)f{lL;  ) 

^b 

b<  0 

6-a-(ft3  al)l{2V,  ) 

lb 

a  >0 

fc-0-o3/(2L:)-^/(2L+) 

otherwise 

05) 


where  A  =  x  -  X(  -  lp,  B  =  x  -  xt  +  lp,  a  =max(A,  -L“ ) 
and  b  =min {B,  L*).  When  L~  =  L* ,  Eqs.  (14)  and 
(15)  are  degraded  to  the  cases  for  uniform  grid.  Without 
detailed  proof,  the  grid  shape  function  and  the  weight¬ 
ing  function  still  satisfy  partition  of  unity.  Similarly,  the 
gradient  of  the  modified  weighting  function  can  be  com¬ 
puted. 

It  may  be  noted  that  in  Fig.  2  the  refinement  ratio  is  two, 
i,e+1  the  length  of  a  side  of  a  coarse  cell  is  twice  that  of 
the  fine  cell.  To  maintain  the  convenience  of  the  struc¬ 
tured  grid,  only  integer  refinement  ratio  should  be  used. 
All  nodal  positions  can  be  computed  from  the  domain 
of  each  grid  and  the  cell  sizes.  The  proposed  refinement 
scheme  can  be  applied  to  any  integer  refinement  ratio  and 
for  multiple  times  for  successive  refinements.  As  an  ex¬ 
ample,  the  weighting  function  between  a  panicle  of  size 
0.5  xQ.5  and  a  node  at  (0, 0)  with  an  influence  zone  of  [  1, 
h  I*  1  ]  is  shown  in  Fig,  3(a).  The  particle  is  on  the  X 
Y  plane  and  the  weighting  function  is  computed  at  each 
particle  position.  For  comparison,  the  influence  zone  is 
changed  to  [l,  0.5,  1,1],  representing  a  transitional  node, 
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Figure  5  :  Simulation  showing  separation  when  the  particle  deformation  is  tracked  by  strain 


while  other  conditions  are  the  same.  The  weighting  func¬ 
tion  for  this  case  is  plotted  in  Fig.  3(b).  It  can  be  seen  that 
the  weighting  function  for  the  transitional  node  is  still  C1 
continuous. 

4  Numerical  simulations 

4 /  Tracking  particle  deformation 

Prior  to  presenting  the  results  of  structured  mesh  re¬ 
finement,  tracking  particle  deformation  is  addressed  first 
since  this  is  necessary  in  later  simulations  to  achieve  ac- 
curacy.  The  material  particles  are  initialized  into  regular 
shapes,  normally  square  and  cube  for  2D  and  3D  simu¬ 
lations,  respectively.  All  the  physical  quantities  in  a  par¬ 
ticle  domain  are  considered  to  be  uniform.  The  shape 
of  the  panicle  changes  during  deformation.  So,  it  is  im¬ 
portant  to  track  the  deformation  of  each  particle.  Fig.  4 
illustrates  the  deformation  of  the  particles  in  2D  when 
the  particles  are  stretched  in  the  X-direction.  If  the  par¬ 
ticle  deformation  is  not  tracked,  gaps  will  form  between 
neighboring  particles  in  the  X -direct ion,  as  shown  in  Fig. 
4  (a).  Due  to  Poisson's  effect,  there  will  be  overlap¬ 
ping  between  panicles  in  the  Y-direction,  if  the  particles 
do  not  follow  the  deformation  of  the  materials  properly. 
When  the  stretch  and  gaps  arc  large  enough,  the  particles 
would  he  separated.  Fig.  4  (b)  shows  the  correct  defor¬ 


mation  in  which  contiguous  particles  remain  contiguous 
after  deformation. 

o 


Figure  6  :  Velocity  field  of  a  continuum  region  (the  ar¬ 
rows  represent  both  direction  and  magnitude) 

To  track  the  particle  deformation,  a  convenient  approach 
is  to  calculate  the  deformed  particle  shape  based  on  strain 
history.  Since  the  linear  strain  increment  is  computed 
in  GIMP,  the  effectiveness  and  validity  of  tracking  parti¬ 
cle  deformation  based  on  strain  would  be  limited  to  rel¬ 
atively  small  deformations.  As  an  example.  Fig,  5  (a) 
shows  the  simulation  of  a  uniaxial  tension  problem  under 
plane  strain  conditions.  The  material  is  silicon.  Its  mass 
density  is  2.71  g/cm3,.  Young's  modulus  175.8  GPa,  and 
Poisson’s  ratio  0,28,  The  background  grid  size  is  0.002 
x  0.002  mm2.  One  particle  per  cell  is  assigned  initially 
and  the  time  step  is  0.02  ns  ( l  ns  =  10  9  s).  The  applied 
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(b)  FEM 


<J  :  95QOO  BDODO  75000  70  000  65000  60000  55000  50000 
Figure  7  :  GIMP  results  with  tracking  deformation  of  comers  and  their  comparison  with  FEM 


pressure  increases  linearly  with  time  from  0  to  1 0  ns  and 
is  then  maintained  constant,  as  shown  in  Fig-  5  (b).  The 
elongation  in  the  X -direction  of  the  particle  is  computed 
as  { I  +  where  /“is  the  initial  length  of  the  particles. 
Separation  of  the  particles  occurred  at  ^25%  strain  at  6 
ns  before  the  full  pressure  was  applied,  as  shown  in  Fig. 
5  (c),  Similar  problems  have  been  reported  by  Guilkey 
(2005). 
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Tracking  particle  deformation  by.  strain  could  be  more 
effective  if  nonlinear  strain  is  used  in  the  GIMP  method. 
However,  recovery  of  the  deformed  shape  based  on  non¬ 
linear  strain  involves  additional  complications.  Based 
on  the  GIMP  algorithm,  there  is  another  convenient  ap¬ 
proach  to  track  the  particle  deformation.  Numerically, 
the  displacement  and  velocity  of  each  particle  in  GIMP 
are  computed  at  the  center  of  the  particle  to  represent  the 
entire  particle  domain.  However,  in  reality,  the  velocity 
and  deformation  at  the  comers  of  a  particle  can  be  dif¬ 
ferent  from  the  center.  Fig.  6  shows  four  2D  contiguous 
particles  sharing  one  common  comer  point  at  the  middle. 
This  comer  point  should  have  unique  displacement  and 
velocity.  As  a  result,  it  is  helpful  to  track  the  displace¬ 
ment  and  velocity  of  each  comer  to  track  the  particle  de¬ 
formation. 

It  is  not  difficult  to  compute  the  velocity  of  the  particle 
comer  given  its  location.  It  is  computed  from  the  inter¬ 
polation  from  the  background  grid,  similar  to  the  center 
of  the  particle.  For  a  2D  particle,  in  addition  to  updating 
the  position  of  the  center  of  the  particle,  the  positions  of 
the  four  corners  are  updated  at  each  increment.  To  com¬ 
pute  the  weighting  function  for  the  corners,  a  fictitious 


Figure  8  :  Two-dimensional  indentation  simulation 
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Figure  9  :  Comparison  of  the  stress  distributions  at  different  levels  of  refinements  in  force  indentation 


size  can  be  assigned  to  each  corner.  Numerical  simu¬ 
lation  shows  that  the  result  is  not  sensitive  to  this  size 
in  the  range  of  10%  to  80%  of  the  initial  particle  size* 
The  new  panicle  shape  can  be  obtained  by  connecting  the 
four  comers  with  straight  lines*  In  order  to  avoid  numeri¬ 
cal  integration  of  the  interpolation  function,  it  is  assumed 
that  the  deformed  material  particle  shape  is  rectangular 
with  edges  parallel  to  the  coordinate  axes.  The  size  of 
the  rectangle  is*  therefore*  determined  from  the  extent  of 
the  corners.  As  will  be  demonstrated  later  in  this  section* 
this  assumption  does  not  introduce  any  significant  error 
while  it  can  greatly  improve  the  efficiency  of  the  GIMP 
algorithm* 


Using  this  approach  to  track  the  particle  deformation,  the 
problem  in  Fig,  5  (a)  was  simulated  again  with  GIMP 
and  the  results  at  20  ns  are  plotted  in  Fig.  7  (a).  No  sep¬ 
aration  of  particles  was  seen  during  the  entire  simulation 
up  to  50%  strain.  It  is  noted  that  each  material  particle 
is  plotted  as  a  square  of  the  same  size  and  the  particle 
deformation  is  not  shown  due  to  software  limitations  on 
visualization.  For  comparison*  the  same  problem  is  sim¬ 
ulated  using  FEM  (Abaqus/Expiicit)  and  the  FE  result  is 
shown  in  Fig.  7  (b).  It  can  be  seen  that  these  two  sets  of 
results  agree  reasonably  well  with  each  other;  the  maxi¬ 
mum  difference  in  maximum  tensile  stress  is  ^8%. 
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Figure  10  ;  Comparison  of  displacement  history  with 
FE 


Figure  11  :  A  simple  tension  problem  with  displace¬ 
ment  boundary  conditions 


Figure  12  :  Comparisons  of  the  displacement  and  stress 


4.2  Indentation  problem 

To  verify  the  refinement  algorithm,  a  2D  indentation 
problem  was  simulated.  Pressure  is  applied  at  the  top 
of  the  workpiece,  as  shown  in  Fig.  8.  The  dashed  lines 
indicate  the  borders  of  refinement  levels.  The  work  mate¬ 
rial  is  silicon  with  the  properties  the  same  as  those  given 
in  the  previous  section.  Due  to  symmetry  with  respect  to 
the  Y-axis,  only  half  of  the  workpiece  is  modeled.  The 
size  of  the  model  is  0.027  x  0  04  mm2.  The  pressure  in¬ 
creases  linearly  with  time  from  t  =  0  -  30  ns  and  is  then 
kept  constant  at  60  GPa, 

Several  simulations  were  performed  under  different  set¬ 
tings  for  the  purpose  of  comparison.  In  the  first  simu¬ 


lation,  a  uniform  grid  with  a  cell  size  of  0,001  x  0.001 
mm2  is  used  and  the  time  step  is  20  ps  (1  ps  -  10^12 
s).  The  stress  distribution  in  the  workpiece  at  t  =  20  ns 
is  shown  in  Fig.  9(a).  In  this  figure,  the  units  of  length 
and  stress  are  mm  and  MPa,  respectively.  In  the  second 
simulation,  as  indicated  in  Fig.  9(h),  two  levels  of  re¬ 
finements  are  used  with  the  refinement  ratio  to  be  2,  The 
cell  lengths  are  0.002  mm  and  0.001  mm  for  the  first  and 
second  levels,  respectively.  The  fine  level  covers  a  rect¬ 
angular  area  of  the  workpiece  from  (0,  0.02)  to  (0.02, 
0.04).  The  grid  is  fixed  in  space;  the  material  particles 
initially  in  the  fine  region  move  to  the  coarse  region  dur¬ 
ing  deformation.  As  shown  in  Fig.  9(b),  some  fine  mate¬ 
rial  particles  have  moved  below  the  line  Y  -  0.02  mm.  It 


85 


222  Copyri^/u©  2006  Tech  Science  Press 


CMES,  vol.  12,  no  X  pp.21.T227,  2006 


may  be  noted  that  in  Fig.  9  the  material  particles  are  plot¬ 
ted  as  squares  corresponding  to  their  initial  sizes*  Gaps 
between  panicles  are  intentionally  shown  to  depict  the 
particle  sizes* 

Three  levels  of  refinements  are  used  in  the  third  simu- 
lation  of  the  same  problem,  in  this  simulation*  the  time 
step  is  10  ps  and  the  results  are  shown  in  Fig.  9(c).  The 
stresses  agree  very  well  with  the  previous  two  simula¬ 
tions.  Fig.  9{d)  shows  the  result  when  the  particle  de¬ 
formation  is  not  tracked.  A  severe  material  separation 
was  observed  at  t  =  36  ns,  as  indicated  by  the  arrow. 
Additionally*  the  displacement  history  of  the  panicle  in 
the  middle  of  the  top  surface  for  the  third  simulations  is 
shown  in  Fig*  10.  The  same  result  of  the  integration 
point  of  the  element  in  the  middle  of  the  top  surface  in 
FE  simulation  using  ABAQUS/Explicit  is  also  shown  in 
Fig.  10  for  comparison*  It  can  be  seen  that  the  displace¬ 
ment  compares  well  with  FE  up  to  t  =  —20  ns.  After  this 
time,  FE  simulation  aborted  due  to  mesh  distortion.  This 
demonstrates  the  capability  of  GIMP  using  a  grid  with 
structured  refinement  in  handling  large  deformations. 

It  may  he  noted  that  if  the  exterior  comers  of  the  sur¬ 
face  particles  arc  tracked  from  the  nodal  interpolations 
in  the  same  way  as  the  interior  comers*  simulations  tend 
to  become  unstable  due  to  erroneous  surface  corner  dis¬ 
placements.  This  problem  was  observed  to  be  strictly 
and  consistently  associated  with  the  particles  with  exter¬ 
nal  tractions  applied.  It  is  caused  by  insufficient  nodal 
interpolation.  To  eliminate  this  problem,  the  exterior  cor¬ 
ners  of  the  surface  particles  were  tracked  by  strain  only, 
as  used  in  these  simulations. 

43  Validation  of  the  displacement  boundary  condi¬ 
tion 

We  next  describe  the  results  on  the  verification  of  the 
displacement  boundary  conditions.  For  dynamic  simu¬ 
lations*  an  artificial  damping  may  be  introduced.  With 
damping,  the  nodal  momentum  can  be  updated  as 

Aft=*(lf,  +  ff  +  l?+iy-^)Ar1  (16) 

where  d  is  the  artificial  damping  coefficient* 

A  rectangular  slab  is  fixed  on  the  left  and  a  displacement 
boundary  condition  is  applied  on  the  right,  as  shown  in 
Fig.  1 1.  The  material  is  silicon  and  its  properties  are 
given  in  the  previous  section.  The  cell  size  in  this  simu¬ 
lation  is  0,5  x  0.5  mm2  and  four  particles  are  assigned  to 


each  cell  initially.  The  time  step  is  5  ns.  The  prescribed 
displacement  increases  linearly  with  time  to  0,5  mm  at  t 
=  1 0  /*s*  which  corresponds  to  a  velocity  of  50  m/s*  and 
remains  constant  thereafter.  This  problem  is  simulated 
in  FE  using  Abaqus/Explidt  for  comparison*  The  dis¬ 
placement  in  the  X-direction*  U*,  for  a  particle  initially 
centered  at  (2.625*  1*625)  as  a  function  of  time  is  plot¬ 
ted  in  Fig*  12  (a)*  It  can  be  seen  that  without  damping* 
the  vibrations  of  displacement  from  both  FEM  and  GIMP 
simulation  are  in  phase  before  t  -  14  ^s*  but  out  of  phase 
afterwards.  The  dashed  line  represents  the  steady  state 
displacement  at  this  point.  It  can  be  seen  that  when  an 
artificial  damping  of  1 06  s — 1  is  used*  the  GIMP  solution 
converges  quickly.  The  error  of  the  converged  displace¬ 
ment  is  0,46%.  Fig.  1 2  (b)  shows  the  comparison  of  the 
normal  stress  in  the  X-direction.  Good  comparison  be¬ 
tween  GIMP  and  FE  has  been  obtained.  With  damping, 
the  GIMP  solution  converges  to  the  analytical  solution 
for  static  simulations* 

4 A  Stress  concentration  problem 


Figure  13  ;  A  plate  with  a  circular  hole  subjected  to  ten¬ 
sion 

Fig,  13  shows  a  copper  plate  (60  x  60  mm2)  with  a 
central  hole  (4  mm  diameter)  subjected  to  a  distributed 
load.  This  problem  is  simulated  using  GIMP  as  a  dy¬ 
namic  problem  with  a  damping  factor  of  1000  and 
three  levels  of  refinement.  The  cell  sizes  at  these  three 
levels  are  1.0  mm,  0*5  mm.  and  0.25  mm,  respectively. 
One  particle  is  assigned  to  each  cell  not  adjacent  to  the 
hole  initially.  The  cells  close  to  the  circular  hole  are  as¬ 
signed  25  particles  each  to  model  the  circular  edge  more 
accurately  with  the  use  of  square  particles.  It  may  be 
noted  that  all  the  particles  occupy  square  areas  initially. 
The  time  step  is  10  ns  and  the  applied  distributed  load 
intensity  p  =  10  MPa.  The  stress  distribution  after  4000 
steps  is  shown  in  Fig*  14  (a)  and  the  area  dose  to  the 
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(a)  Overall 


(b)  Magnified 


Figure  14  :  Normal  stress  in  the  X-direction  with  p  =  10  MPa 
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Figure  IS  :  Normalized  tangential  stress  along  the  cir¬ 
cumference  of  the  hole 


hole  {bottom  left  comer  in  Fig.  14  (a))  is  magnified  in 
Fig.  14  (b).  The  normal  stress  of  the  particle  at  the  top  of 
the  hole  is  30,7  MPa  when  the  applied  tension  is  10  MPa. 
This  gives  a  stress  concentration  factor  of  3,07.  The  tan¬ 
gential  stress  of  the  particles  along  the  hole  circumfer¬ 
ence,  normalized  by  the  applied  pressure,  is  plotted  in 
Fig.  15  in  comparison  with  the  theoretical  value,  Pilkey 
(1997).  A  good  agreement  between  the  numerical  sim¬ 
ulation  and  theoretical  value  is  obtained.  This  demon¬ 
strates  that  the  GIMP  refinement  algorithm  is  effective 


for  problems  involving  significant  stress  gradients.  Fur¬ 
thermore,  with  the  use  of  small  square  panicles,  GIMP  is 
capable  of  modeling  curved  surfaces. 

4.5  StatQ  stress  intensity  factor 

We  next  use  the  GIMP  refinement  algorithm  to  determine 
the  stress  field  and  stress  intensity  factor  for  a  mode  I 
crack  problem  to  determine  the  capability  of  the  GIMP 
refinement  algorithm  in  simulation  of  stress  distribution 
near  a  crack.  Guo  and  Nairn  (2003)  have  successfully 
extended  the  MPM  method  to  compute  stress  distribu¬ 
tion  in  a  plate  with  explicit  cracks  using  multiple  nodal 
fields  along  the  crack  surface.  The  physical  quantities  of 
material  particles  on  each  side  of  the  crack  were  inter¬ 
polated  using  variables  in  the  field  on  that  side  of  crack 
surface.  In  their  simulation,  a  uniform  mesh  was  used. 
Since  the  stress  gradient  at  the  crack  tip  is  very  high,  a  re¬ 
fined  mesh  near  the  crack  tip  and  a  coarse  mesh  in  the  far 
field  should  lead  to  savings  in  computational  time  while 
maintaining  the  same  accuracy  with  the  use  of  a  uniform 
fine  mesh.  The  fracture  problem  is  thus  an  appropriate 
problem  to  evaluate  the  refined  GIMP  algorithm.  For  this 
purpose,  we  model  the  same  fracture  problem  by  Guo 
and  Naim  (2003)  using  MPM,  that  is,  a  double  cantilever 
beam  (DCB)  with  a  crack  as  shown  in  Fig,  16.  In  the 
area  close  to  the  crack  tip,  finer  meshes  are  used,  while 
in  the  area  far  away  from  the  crack  tip,  coarse  meshes 
are  used.  The  thickness  of  the  plate  is  l  mm,  thereby 
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Figure  16  :  Geometry  of  a  double  cantilever  beam  with 

a  crack 


justifying  a  plane  stress  condition  for  this  problem.  The 
material  of  the  DCB  is  considered  to  be  homogeneous, 
isotropic,  and  linearly  elastic.  Its  density.  Young's  mod¬ 
ulus,  and  Poisson's  ratio  are  1500  kg/m3,  2300  OPa,  and 
0.33,  respectively-  The  applied  force  is  F  =  4  x  10-4  N 
and  this  results  in  a  mode  I  crack  problem.  The  static 
stress  intensity  factor  for  the  DCB  can  be  calculated  us¬ 
ing  the  following  equation  (Kanninen  1973) 


„  ^xF{*+3H/2) 

K>  =  V3 — 77v2 — • 


(17) 
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Figure  17  :  Material  points  and  background  grid  around 
the  crack  tip 


This  problem  has  been  simulated  using  MPM  with  uni¬ 
form  grids  of  three  sizes,  4  mm,  2  mm,  and  1  mm  (Guo, 
Naim  (2003)).  They  have  computed  the  energy  release 
rate  and  determined  the  stress  intensity  factor  from  J* 
integral.  Their  results  indicate  that  the  stress  intensity 
factor  determined  from  finer  grid  is  more  accurate  and 
closer  to  the  theoretical  value. 


Figure  18  :  Computed  stress  intensity  factor  as  a 
function  of  time 


In  the  mesh  refinement  GIMP  algorithm  used  in  this 
study,  the  energy  release  rate  was  computed  using  the 
virtual  crack  closure  technique  (Rybicki  and  Kanninen 
(1977);  Wang,  Karupptah,  Lu,  Roy,  and  Komanduri 
(2005)),  The  energy  released  during  an  infinitesimal 
crack  growth  of  A a  is  assumed  to  be  the  energy  required 
to  dose  the  crack  to  its  initial  size.  Hence,  the  energy 
release  rate  G  is  determined  by 

G  —  lim  r-—  [  (Gn)  Auda.  (18) 

jw-o  2A a  h 

where  n  is  the  unit  normal  vector  at  the  crack  surface. 
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For  the  2D  mode  I  crack  in  the  X-direction, 

] 

G{  =  lim  — —  /  Oyy-Auyda.  (19) 

2Aa  J o 

In  GIMP,  the  energy  release  rate  can  be  computed  as 

g,  =  2 Tto^y-ty'  (20) 

where  the  superscripts  1  and  2  denote  the  two  material 
particles  immediately  to  the  left  of  the  crack  tip  as  shown 
in  Fig,  17,  Aa  is  the  X  distance  between  particle  1  and 
the  crack  tip,  t  is  the  thickness  of  the  beam*  Fttp  is  the 
nodal  force  to  hold  the  crack  lip  together  (Rybicki  and 
Kanninen  (1977))  and  is  computed  as  the  crack  tip  nodal 
force  from  one  side  of  the  crack  in  this  simulation. 

The  mode  I  stress  intensity  factor  for  the  static  crack  is 
given  by 

K,  =  VGE.  (21) 

GIMP  simulations  were  carried  out  using  four  material 
points  per  cell.  The  time  step  is  0.1  /us,  and  a  damp¬ 
ing  coefficient  of  4000  s”1  is  used  to  allow  the  results  to 
converge  to  the  static  data.  The  computed  stress  intensity 
factors  using  uniform  grids  of  4  mm  and  1  mm,  res  pec- 
lively,  are  plotted  in  Fig.  18  (a)  and  the  theoretical  value 
calculated  from  Eq.  (17)  is  also  shown  for  comparison. 
For  simulation  with  refinement,  two  levels  of  refinement 
were  used,  i.e.,  2  mm  grid  size  for  the  coarse  level  and  l 
mm  grid  size  for  the  fine  level,  and  the  extent  of  the  fine 
level  is  indicated  by  the  dashed  square  in  Fig.  16.  The 
computed  stress  intensity  factor  using  the  refinement  al¬ 
gorithm  is  also  plotted  in  Fig.  18  (a).  It  is  seen  that  the 
results  from  two  levels  of  refinement  arc  identical  to  the 
results  from  one  uniform  fine  level  with  1  mm  cell  size. 
Moreover,  the  simulation  time  using  the  refinement  algo¬ 
rithm  is  38%  shorter  than  that  of  one  uniform  fine  level. 
The  computed  stress  intensity  factor  became  even  closer 
to  the  theoretical  value  when  a  third  level  of  refinement 
was  added  for  the  crack  tip  as  shown  in  Fig.  18  (b).  For 
the  case  of  using  uniform  grid  of  2  mm,  the  computed 
stress  intensity  factor  using  GIMP  and  MPM  (Guo  and 
Naim  2003)  are  plotted  in  Fig.  19.  The  MPM  results 
were  computed  using  a  damping  factor  of  1 QOO  s” 1 ,  and 
therefore,  more  oscillations  can  be  seen  as  expected. 

Fig.  20  shows  the  distribution  of  av  at  t  =  2.5  ms  when 
the  force  applied  on  the  beam  was  changed  to  F  =  4  N, 


Figure  19  :  Comparison  of  the  stress  intensity  factor 
with  MPM  results 


10000  times  of  the  previous  value,  with  other  parameters 
the  same.  In  this  simulation,  three  levels  of  refinements 
with  cell  sizes  of  l  mm,  0.5  mm  and  0.25  mm,  were  used 
and  the  different  density  of  material  points  in  each  level 
can  be  clearly  seen  in  the  figure.  The  computed  stress 
intensity  factor,  scaled  by  10000  times,  still  compares 
well  with  the  theoretical  value.  It  is  noted  that  deforma¬ 
tion  near  the  crack  surfaces  has  caused  matenal  points 
crossing  cell  boundaries,  a  situation  where  MPM  would 
give  numerical  noise  such  as  alternating  stress  signs.  De¬ 
spite  the  extent  of  the  deformation,  and  material  points 
crossing  cell  boundaries,  GIMP  method  with  the  use  of 
particle  deformation  tracking  still  gives  correct  results, 
further  verifying  the  structured  refinement  methods  de¬ 
veloped  herein, 

5  Conclusions 

I.  A  spatial  refinement  scheme  for  a  structured  grid 
was  developed  by  adding  transitional  nodes  and 
by  changing  the  influence  zone  of  the  transition 
nodes  in  GIMP.  The  influence  zone  is  square  for 
uniform  grid  nodes  and  rectangular  for  transitional 
nodes.  This  influence  zone  affects  the  computation 
of  the  nodal  shape  function.  The  computation  of  the 
weighting  function  remains  the  same  as  for  the  uni¬ 
form  grid.  The  refinement  scheme  can  be  applied 
successively  and  the  refined  grid  remains  structural 
in  each  refinement  level,  i.e.,  every  node  can  he  de- 
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termined  by  the  extent  of  the  grid  level  and  cell  size. 
The  refinement  scheme  was  implemented  and  sev¬ 
eral  problems  such  as  tension,  indentation,  stress 
concentration,  and  stress  distribution  near  a  crack 
(mode  I  crack  problem)  were  modeled  to  demon¬ 
strate  its  effectiveness  and  accuracy.  A  good  agree¬ 
ment  has  been  obtained  between  numerical  and  the¬ 
oretical  results,  indicating  the  validity  of  the  struc¬ 
tured  mesh  refinement  for  GIMP  scheme, 

2.  The  GIMP  algorithm  has  also  been  extended  to  in¬ 
clude  the  displacement  boundary  condition,  based 
on  the  approach  used  in  the  meshless  local  Petrov- 
Galerkin  (MLPG)  method,  At  luff  and  Zhu  (2000), 
A  penalty  parameter  is  used  to  impose  the  displace¬ 
ment  boundary  condition  and  a  nodal  force  vector 
hecause  the  displacement  boundary  condition  is  in¬ 
troduced  to  the  nodal  momentum  governing  equa¬ 
tion,  A  uniaxial  tension  problem  with  constant 
pulling  velocity  was  simulated  to  verify  the  dis¬ 
placement  boundary  condition. 

3.  A  method  to  track  the  material  particle  deformation 
was  developed  and  verified  in  one  example.  When 
the  particle  deformation  is  not  tracked,  artificial  sep¬ 
aration  was  observed  when  the  particle  strain  in¬ 
creases  to  a  certain  level  In  tensile  simulations, 
when  the  normal  strain  is  ^25%,  material  particles 
tend  to  separate  from  the  body.  Our  approach  tracks 
the  displacement  of  each  comer  of  the  material  par¬ 
ticles.  Since  neighboring  particles  share  corners, 
no  separation  would  occur  during  deformation  us¬ 
ing  this  approach. 
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Multiscale  Simulation  Using  Generalized  Interpolation  Materia)  Point 
(GIMP)  Method  and  Molecular  Dynamics  (MD) 

J,  Ma1,  H.  Lu1,  B.  Wang1,  R,  Hornung3,  A.  Wissink2,  and  R.  Komanduri1 
Abstract 

A  new  method  for  multiscale  simulation  bridging  two  scales,  namely  the 
continuum  scale  using  the  generalized  interpolation  material  point  (GIMP)  method  and 
the  atomistic  scale  using  the  molecular  dynamics  (MD),  is  presented  and  validated  in  2D. 
The  atomistic  strain  from  the  molecular  dynamics  simulation  is  determined  through 
interpolation  of  displacement  field  into  Eulerian  background  grid  using  the  same 
generalized  interpolation  functions  as  in  GIMP  method.  The  atomistic  strain  as  defined  is 
consistent  with  that  as  determined  from  virial  theorem  for  interior  points,  but  provide 
more  reasonably  stable  values  at  the  boundary  of  MD  region  or  in  the  handshaking  region 
between  MD-GIMP  so  that  physical  quantities,  such  as  the  displacement,  internal  forces, 
and  energy  density  are  compatible  in  (he  handshaking  region.  A  continuum  material  point 
in  the  continuum  is  split  into  smaller  material  points  usmg  multi-level  refinement  until  it 
has  nearly  reached  the  atom  size  to  couple  with  atoms  in  the  MD  region.  Consequently, 
coupling  between  GIMP  and  MD  is  achieved  by  using  compatible  deformation  and  force 
fields  in  the  transition  region  between  GIMP  and  MD.  The  coupling  algorithm  is 
implemented  in  the  Structural  Adaptive  Mesh  Refinement  Application  Infrastructure 
(SAMRAI)  for  parallel  processing.  Both  mode  I  and  mode  11  crack  propagation  problems 
arc  simulated  using  the  coupling  algorithm.  The  stress  field  near  the  crack  tip  was 
validated  by  comparing  the  results  from  coupled  simulation  with  purely  GIMP 
simulations  of  the  same  model.  Coupled  simulation  results  were  also  compared  with  pure 
MD  simulation  results.  In  both  cases,  a  very  good  agreement  was  obtained. 

Keywords:  incremental  atomic  strain,  muitiscale  simulation,  GIMP,  MD,  coupling, 
SAMRAI 
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1.  Introduction 

Simulations  at  the  continuum  and  atomistic  levels  are  often  used  to  determine  the 
material  deformation  and  failure  at  their  respective  length  scales.  The  atomistic 
simulations  are  usually  performed  at  very  small  length  scales  (?)  and  time  scales  (?) 
whereas  continuum  simulations  are  performed  at  larger  scales  (?).  While  the  atomistic 
scale  simulation  can  reveal  the  fundamental  aspects  of  deformation  and  failure  behavior, 
it  cannot  be  scaled  up  for  larger  length  scales  due  to  limitation  in  computing  power. 
However,  for  problems,  such  as  nanoindentation  and  crack  propagation,  a  combined 
atomistic  and  continuum  simulations  may  provide  as  much  information  as  a  purely 
atomistic  simulation  would  provide.  This  is  especially  so  when  MD  simulations  are  used 
for  regions  encompassing  high  stress  gradient  zones,  such  as  the  crack  propagation  zone 
(Kohlhoff,  Gumbsch  and  Fischmeister,  1991)  and  the  workpiece  just  underneath  the 
indenter  as  in  nanoindentation  (Shilkrot,  Miller  and  Curtin,  2002)  whereas  continuum 
region  is  used  for  the  rest  of  material. 

Several  techniques  have  been  proposed  for  simulation  bridging  two  or  more  scales 
(Kohlhoff.  Gumbsch  and  Fischmeister,  1991;  Shilkrot.  Miller  and  Curtin,  2002;  Curtin 
and  Miller,  2003;  Shiari,  Miller  and  Curtin,  2005;  Raffi-Tabar,  Hua  and  Cross,  1998).  A 
transition  region,  or  handshake  region,  overlapped  by  the  outer  boundary  of  the  atomistic 
region  and  the  inner  boundary  of  the  continuum  region,  is  usually  used  to  transfer  the 
physical  quantities  (Kohlhoff,  Gumbsch  and  Fischmeister,  1991;  Shilkrot,  Miller  and 
Curtin,  2002;  Curtin  and  Miller,  2003).  At  the  continuum  level,  in  multiscale  simulation, 
the  finite  element  method  (FEM)  is  often  used.  Waves  with  the  wavelength  larger  than 
the  element  size  can  be  transmitted  from  MD  into  continuum,  and  on  the  other  hand, 
waves  with  wavelength  smaller  than  the  element  size  get  reflected  artificially  (Raffi- 
Tabbar,  Hua  and  Cross,  1998).  Efforts  were  made  on  minimizing  the  wave  reflection 
while  enforcing  the  displacement  and  force  continuity,  as  well  as  energy  conservation  in 
the  handshake  region.  An  the  intermediate  scale,  namely,  the  mesoscale  simulation,  was 
also  used  in  bridging  the  continuum  and  atomistic  simulations  (Raffi-Tabar,  Hua  and 
Cross,  1998).  Recently,  nonlinear  deformation  has  been  considered  in  coupling.  The 
coupled  atomistic/continuum  discrete  dislocation  (CADD)  method  has  demonstrated 
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superior  capability  in  detecting  dislocations  from  MD  simulation  and  passing  them  to  the 
continuum  region,  as  well  as  handling  explicit  material  defects  and  inclusions  (Shiari, 
Miller  and  Curtin,  2005),  For  2D  simulations,  the  dislocations  generated  in  the  atomistic 
region  can  propagate  into  the  continuum  region  by  defining  continuum  elements  with  slip 
planes  in  front  of  the  transition  region  to  detect  dislocations  (Shilkrot,  Miller  and  Curtin, 
2002;  Curtin  and  Miller,  2003). 

Other  simulation  techniques,  such  as  the  meshless  local  Petrov-Galerkin  (MLPG) 
method  (Shcn  and  Atluri,  2005),  Green’s  function  method  (Cai,  Koning,  Bulatov  and  Yip, 
2000),  and  coarse-grained  molecular  dynamics  (Rudd,  Broughton,  1998)  are  also  used  at 
the  continuum  level.  These  techniques  show  advantages  in  heat  transfer,  stress 
compatibility  or  minimizing  wave  reflection  for  the  coupling  between  atomistic  and 
continuum  levels.  Mesh  distortion  associated  with  FEM  is  an  issue  for  large  nonlinear 
deformations  and  dislocations.  Recently,  the  material  point  method  (MPM)  {Sulsky, 
Zhou  and  Schreyer,  1995;  Sulsky  and  Schreyer,  1996)  was  introduced  for  dynamic 
simulations  and  the  general  interpolation  material  point  (GIMP)  method  (Bardenhagen 
and  Kober,  2004)  was  presented  with  improved  simulation  stability.  The  MPM  and 
GIMP  use  material  points  to  represent  a  material  continuum  and  utilizes  both  Lagrangian 
description  (for  material  points  carrying  physical  variables)  and  Eulerian  description  (for 
convection  of  physical  variables  and  solution  of  Field  equations).  Because  the 
GIMP/M  PM  methods  do  not  use  a  fixed  body  mesh  so  that  mesh  entanglement  associated 
with  highly  nonlinear  deformations  can  be  prevented.  GIMP  and  MPM  have 
demonstrated  the  capabilities  in  the  computation  of  stresses  and  strains  in  metal  forming, 
dynamic  fracture  (Guo  and  Naim,  2004),  and  impact  problems  (Sulsky  and  Shreyer, 
1996).  Recently,  GIMP  has  been  successfully  implemented  for  2D  simulations  covering 
length  scales  from  nanometers  to  millimeters  (Ma,  Wang,  Lu,  Roy,  Hornung,  Wissink 
and  Komanduri,  2004)  using  multi-level  refinements  with  parallel  computing  in  the 
framework  of  Structured  Adaptive  Mesh  Refinement  Infrastructure  (SAMRAf)  (Hornung 
and  Kohn,  2002).  The  GIMP  refinement  technique  proposed  here  has  made  it  possible  for 
coupling  with  MD  so  that  the  advantages  of  G1MP/MPM  methods  can  be  fully  carried 
over  to  mulliscale  simulations.  This  paper  will  describe  the  GIMP/MD  coupling 
techniques,  and  demonstrate  it  in  the  Mode  I  and  Mode  II  crack  propagation  problems. 
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One  aspect  of  the  problems  associated  with  coupling  is  the  conversion  of  physical 
quantities  computed  from  MD  region  to  continuum  region,  and  vice  versa.  MD 
simulation  gives  atom  positions,  velocities,  etc.,  so  that  information  can  be  passed  over 
back  and  forth  between  two  different  regions.  The  stresses  and  strains  defined  in  a 
material  continuum  can  be  computed  from  MD  simulation  using  various  approaches  (Gao, 
Huang  and  Abraham,  2001;  Buehler,  Abraham  and  Gao,  2003;  Zimmermann,  1999; 
Horstemeyer  and  Baskes,  2000).  A  typical,  and  still  most  effective  approach  thus  far  is 
based  on  the  virial  theorem  (Marc  and  McMillan,  1985;  Zimmermann,  Webb,  Hoyt, 
Jones,  Klein.  Bammann,  2004).  However,  it  has  some  limitations  such  as  not  being  able 
to  compute  stress/strain  at  the  boundary  of  the  MD  region.  Some  new  approaches  have 
been  developed  to  calculate  the  atomic  stress  more  accurately.  For  example,  Zhou  and 
McDowell  (2004)  defined  an  equivalent  continuum  for  the  atomistic  system  with 
conserved  momenta,  work  rates,  and  mass.  The  atomic  stresses  are  then  calculated  from 
the  internal  forces  and  lattice  constants.  Hardly  (see  Zimmermann  et  al.  (2004)  for  a 
summary  of  Hardly’s  approach)  computed  the  atomic  stresses  from  the  contribution  of 
nearby  atoms  with  spatial  averaging  using  a  localization  function. 

The  atomistic  strain  measures  the  deformation  of  the  atomic  lattice,  it  is  also  used 
in  combined  atomistic  and  continuum  studies  of  material  behavior,  Buehler,  Gao  and 
Huang  (2004)  investigated  the  stress  and  strain  fields  near  the  crack  tip  in  MD 
simulations  and  revealed  that  continuum  mechanics  can  be  used  at  nanoscale  (no 
conditions  stated).  In  another  study,  Bueher,  Abraham  and  Gao,  2003  showed  that  in  a 
harmonic  solid,  the  local  hyperelastic  wave  speed  governs  the  crack  speed  of  a  mode-1 
crack  without  branching  when  the  hypcrelastic  zone  approaches  the  energy  length  scale 
near  the  crack  tip. 

This  paper  presents  a  method  for  coupling  MD  with  GIMP  simulations  in  2D.  A 
new  formulation  to  compute  the  atomistic  strain  rate  and  strain  increment  is  developed  by 
computing  the  velocity  gradient  based  on  the  interpolation  of  the  velocity  field  on  a 
background  grid.  Based  on  the  atomic  strain  computation,  an  equivalent  continuum 
system  can  be  defined  for  the  atomic  region  for  the  purpose  of  coupling  MD  with 
continuum  computation  using  the  GIMP  method.  The  coupled  GIMP  and  MD 
simulations  use  the  same  background  mesh.  The  coupling  scheme  is  based  on  the 
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atomistic  strain,  stress,  as  well  as  deformation  while  neglecting  the  heat  transfer.  Both 
mode  1  and  mode  II  crack  propagation  problems  are  simulated  to  demonstrate  the 
coupling  approaches.  The  coupled  simulation  results  are  compared  with  purely  GIMP 
simulation,  and  pure  MD  simulation  for  validation. 

2.  GIMP  and  refinement 

In  the  generalized  interpolation  material  point  method  (GIMP)  (Bardenhagen  and 
Kober,  2004),  a  continuum  is  discretized  into  a  collection  of  material  points.  Each 
material  point  carries  all  the  physical  variables  for  a  fully  defined  problem  in  solid 
mechanics,  such  as  the  position,  mass,  velocity,  stress  and  strain,  etc.  The  material  point 
deforms  to  represent  the  motion  of  the  continuum  while  the  mass  at  each  material  point 
remains  constant.  For  a  dynamic  problem,  using  the  variational  principle,  the  momentum 
conservation  equation  is  given  by 

| pa  ■  Svdx  +  [o  :  VtSW/x  =  jb  ■  Svdx  +  |c  ■  Svdx ,  ( I ) 

n  n  a  cu 

where  p  is  the  mass  density,  a  is  the  acceleration  field,  b  is  the  body  force  density,  o  is 
the  Cauchy  stress  tensor,  Sv  is  an  arbitrary  admissible  velocity  field  and  fi  is  the  current 
configuration  (Bardenhagena  and  Kober,  2004).  To  solve  this  equation,  a  background 
grid  is  introduced  for  interpolation  between  the  material  points  and  the  nodes  on  the 
background  grid  using  a  weighting  function  that  will  be  described  later.  Hence,  Eq.  (I) 
can  be  discretized  and  the  equation  of  motion  is  eventually  solved  at  each  node, 

p,  =f;n*  +  f*+f“',  (2) 

where  the  nodal  momentum,  internal  force  and  body  force  are  obtained  by  summing  the 
contributions  from  each  material  points  p  to  this  node  i  as  * 

P 

p''VS,pVl,  and  fj1  =y^mllbStP  ,  respectively.  mp  and  Vp  are  the  particle 

P  P 

mass  and  volume.  The  external  force  is  given  by  f“'  =  jc5,(x)c/S,  where  c  is  the 

an, 

surface  traction.  Finally,  the  position,  strain  and  stress  of  the  material  points  can  be 
updated  using  the  interpolation  from  surrounding  nodes. 
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For  GIMP  simulations,  in  areas  with  high  stress  gradient,  finer  computational  mesh 
and  smaller  lime  step  should  be  used  to  improve  accuracy.  Both  spatial  and  temporal 
refinements  have  been  introduced  in  a  multilevel  refinement  algorithm  (Ma,  Wang,  Lu, 
Roy,  Homung,  Wissink,  and  Komanduri,  2005)  in  which  the  computational  domain 
consists  of  nested  grid  levels  with  successive  refinement.  Each  finer  level  covers  part  of 
the  next  coarse  level  and  each  level  is  computed  separately  with  its  own  time  increment. 
Smaller  time  increments  are  used  for  finer  levels  and  the  communication  between  two 
grid  levels  are  performed  when  these  two  levels  are  synchronized.  Fig.  1  shows  two 
neighboring  coarse  and  fine  grid  levels  in  2D  GIMP  computations  with  a  refinement  ratio 
of  two.  The  thick  line  represents  the  physical  boundary  of  the  fine  level  with  four  layers 
of  ghost  cells.  Initially,  four  material  points  are  assigned  to  each  cell  at  the  fine  level.  At 
the  coarse  level,  the  portion  overlapped  by  the  fine  level  is  assigned  with  16  material 
points  per  cell.  Hence,  these  material  points  have  the  same  size  and  initial  positions  as 
those  at  the  fine  level.  The  rest  of  the  coarse  level  is  assigned  with  four  material  points 
per  cell.  Two  data  exchange  processes,  i.e.,  refinement  and  coarsening  are. used  in  the 
communication.  Refinement  process  passes  information  from  the  coarse  level  to  the 
immediate  fine  level,  while  coarsening  process  passes  information  from  fine  level  to  the 
next  coarse  level.  In  the  refinement  process,  physical  variables  at  the  fine  material  points 
inside  the  thick  lines  are  copied  directly  to  replace  the  material  points  at  the  coarse  level, 
In  the  coarsening  process,  the  physical  variables  at  coarse  material  points  are  copied  to 
the  ghost  cells  of  the  immediate  fine  level. 


Ghost  particles  of  fine  level 
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Fig.  1 :  Two  neighboring  coarse  and  fine  grid  levels  in  2D  GIMP  computations 
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Using  SAMRAI,  each  grid  level  can  be  divided  into  multiple  patches  for  parallel 
processing.  Each  patch  is  rectangular  and  is  assigned  to  a  processor.  The  patches  are 
overlapped  by  ghost  cells  for  communication.  The  parallel  processing  scheme  will  be 
discussed  in  more  detail  in  later  sections. 


3.  MD  Simulation  and  Atomistic  strain 


Molecular  dynamics  (MD)  simulations  solves  the  motion  of  the  atoms  by 
integrating  the  equations  of  motion  governed  by  Newton’s  second  law  f  =  ma  .  In  general, 
short  range  interaction,  governed  by  an  atomic  potential  function,  among  atoms  within 
the  cutoff  radius  is  considered.  With  the  rapid  increase  of  computing  power,  MD 
simulation  is  becoming  a  powerful  tool  in  simulation  of  material  behavior.  The  MD  code 
used  in  this  paper  is  the  LAMMPS  code  (Plimpton,  1995)  developed  at  the  Sandia 
National  Laboratories. 

To  couple  MD  with  continuum  simulation  using  the  GIMP  method,  we  need  to 
pass  MD  results  to  continuum  simulation.  For  this  purpose,  variables  used  in  MD  must  be 
consistent  with  those  in  the  GIMP  simulation.  The  most  critical  issue  is  passing  the  atom 
velocities,  positions  to  the  continuum  region.  This  involves  the  computation  of 
continuum  strain  fie  is  based  on  the  discrete  atom  deformation  information. 

There  are  numerous  publications  in  recent  years  on  the  computation  of  strain  from 
MD  results  (?).  One  successful  approach  is  the  formulation  to  calculate  the  atomic  virial 
strain  developed  by  Zimmermann,  Webb,  Hoyt,  Jones,  Klein,  and  Bammann  (2004),  In 
this  approach,  the  left  Cauchy-Green  deformation  tensor  for  atom  a  at 
coordi  nates  (jc“  ,  x  ? ,  Jtj )  is  written  as 


b: 


1  A 

7  nl 

A  b*o  nfJb 


(3) 


where  Ax,"*  =  jc"  -  x*  ,  At"*  =  x“  -  x*,  Ra„  is  the  undeformed  distance  between  atoms  a 

and  b,  and  N  is  usually  limited  to  the  nearest  neighbors.  X  is  a  factor  dependent  on  the 
lattice.  X  =  3  for  the  2D  triangular  lattice  if  the  six  nearest  neighboring  atoms  arc  only 
considered  (Zimmermann,  1999).  Therefore,  the  Eulerian  strain  tensor  for  atom  a  is 
found  as 
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«;=}(<*» <4> 

where  Sv  is  the  Kronecker  delta.  The  virial  strain  calculated  above  is  meaningful 

instantaneously  in  time  and  space.  However,  at  locations  where  the  nearest  neighbor  list 
does  not  exist  or  cannot  be  determined,  such  as  at  the  external  boundary,  crack  surfaces 
and  interfaces,  Eq.  (3)  is  invalid.  To  circumvent  this  drawback,  we  propose  an  alternative 
approach  as  described  in  the  next  section  to  calculate  the  atomistic  strain  based  on  the 
strain  rate. 


3.1  Incremental  atomic  strain 

In  continuum  mechanics,  the  strain  is  defined  as  the  gradient  of  the  deformation 
field.  To  construct  the  deformation  field  from  the  MD  simulation,  an  Eulerian 
background  grid,  same  as  that  used  in  GIMP,  is  utilized.  All  the  quantities  defined  at 
atoms,  such  as  the  mass,  velocity,  and  forces  can  be  projected  to  the  background  nodes 
through  interpolation,  While  the  nodal  positions  are  fixed  in  space,  the  nodal  quantities 
can  vary  with  time,  The  velocity  of  atom  a  inside  the  grid  is  written  as 

v(*.,0  =  £s„(*.)V((0, 

I  at 

where  n  is  the  number  of  nodes  and  5„,(x)  is  the  interpolation  function  between  node  i 
and  atom  a.  By  definition,  the  strain  of  atom  a  is  then 

e(x.,0  =  ^  {Vu(xo,0  +  [Vu(x0,/)]r},  (6) 

where  u(xu,r)  is  the  deformation  field,  The  strain  rate  is  given  by 


E(xo,0  =  i  {Vv(xB,0  +  [Vv(xfl,0]r} .  0) 

ft 

The  velocity  gradient  at  atom  a  can  be  expressed  as  Vv(xfl,0  =  ^  V5^(xa  )V.  (/) .  Using 

the  Euler  integration  scheme,  we  calculate  the  strain  at  atom  a  for  the  next  time  step 
through 


l+Af 

fcu 


=  £ 


+  £'+A,A/, 


(8) 
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where 


=  iZ(V511)V'+4'  +  v;ti,V5,n). 

2  j.t 


The  nodal  mass,  force,  and  momentum  are  computed  from  neighboring  atoms  as 


) ,  respectively.  The  updated  nodal 


momentum  and  velocity  are  P/*^  =  P'  +  F,'A/  and  V/*41  =  P'*4'  /  M\ . 

3.2  Interpolation  function 

To  ensure  conservation  of  mass,  momentum,  and  energy  between  the  nodes  and  the 
atoms,  the  interpolation  function  should  satisfy  partition  of  unity.  The  interpolation 
function  S,a(x)  can  be  chosen  to  be  the  isoparametric  shape  functions  used  in  finite 

element  analysis.  However,  the  generalized  interpolation  functions  with  C1  continuity  in 
GIMP  has  shown  better  simulation  stability  when  an  Eulerian  grid  is  used  (Bardenhagen 
and  Kober,  2004;  Ma,  et  al.  2005)  and  it  is  used  to  compute  the  atomic  strain  in  this 
investigation. 

The  generalized  interpolation  function  introduced  by  Bardenhagen  and  Kober 
(2004)  consists  of  two  functions,  i.e.,  the  nodal  shape  function  and  the  particle 
characteristic  function.  Both  the  nodal  shape  function  and  the  material  point  characteristic 
function  can  be  extended  to  3D.  In  three  dimensional  situations,  the  nodal  shape  function, 
for  node  i,  is  given  by 


Sl(x)^S:(x)S^y)S;(z), 


(9) 


and  the  particle  characteristic  function,  for  a  brick  particle  p,  is  given  by 

zHx)zZ(y)z',(*) 


(10) 


where 


0 


l+tx-jc,)/^  -L,<x-x.  <0 

1  -  (je  -  x( )  /  Lt  0  <  x  -  x,  <  Lx 


(ID 


0 


Lx  <x-x, 


and 


Zx„(x)  =  HI*  ~  (*,  -  K )]  -  HI*  ~  (xp  +Oh 


(12) 
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where  H(x)  denotes  the  step  function.  Fig.  2  shows  a  rectangular  grid  cell  with  a  particle 
in  it.  The  particle  position  is  taken  at  the  center  of  the  particle.  It  may  be  noted  that  the 
grid  associated  with  the  generalized  interpolation  function  is  a  structured  grid,  which  is 
convenient  to  construct  and  process. 


Fig.  2:  2D  representation  a  particle  and  a  grid  cell 
Both  the  nodal  shape  function  and  the  particle  characteristic  function  are  a  partition 


of  unity,  i.e.,  £/;j(x)  =  l  and  ^5,(x)  =  l .  The  generalized  interpolation  function  is  a 

p  * 

volume  averaged  weighting  function  between  node  i  and  material  points  p  given  by 
S<p=tt  {^p(x)5y(x)fifx,  (13) 

v  p  non, 

where  Q  is  the  entire  computational  configuration,  is  the  spaces  occupied  by  particle 


p,  V  is  the  current  particle  volume. 

The  atoms  are  generally  regarded  as  spherical  in  shape.  The  characteristic  function 
of  a  sphere  can  also  be  found.  However,  to  simplify  the  computation  of  the  interpolation 
function  and  its  gradient,  we  consider  that  the  atoms  are  cubic  in  shape  with  the  same 
volume  of  a  spherical  atom.  We  also  assume  that  the  shape  and  orientation  of  the  atoms 
do  not  change  in  the  simulation.  Consequently,  Eq,  (13)  is  simplified  as 


r,*l,  !,*'i 

S„  =—^—  jS-(x)dx  J S>(y)dy  \s;{z)dz. 


w.V.  -A 


(14) 


The  final  expression  of  the  interpolation  function  and  its  gradient  can  be  found  in 
Bardenhagena  and  Kober  (2004).  The  interpolation  function  is  plotted  in  Fig.  3.  The 
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vertical  axis  is  the  interpolation  function  and  horizontal  plane  is  the  particle  position.  The 


node  is  at  (0,  0).  The  cell  size  is  1*1  and  the  particle  size  is  0, 5*0.5.  The  interpolation 
Function  is  continuous  and  smooth.  It  may  be  noted  that  the  volume  bounded  by  the 
interpolation  function  and  the  horizontal  plane  is  one. 


Fig.  3:  The  generalized  interpolation  function  in  2D 


3.3  Numerical  validation 

Several  MD  simulations  are  performed  to  calculate  the  atomistic  strain  in  2D.  In 
these  simulations,  the  atomic  potential  chosen  is  the  Lennard-Jones  potential  given  by 


(15) 


6 


with  £■„  =  1 ,0  and  a  =  I .  Based  on  the  unit  convention  in  LAMMPS,  the  units  for  length 
and  energy  in  this  paper  are  er  and  £0,  respectively.  This  potential  gives  the  elastic  wave 
speeds  as  follows:  longitudinal  wave  speed  c,  =8.99,  shear  wave  speed  ct  =  5.19,  and 
the  Rayleigh  wave  speed  cR  =  4.80 .  2D  triangular  lattice  is  used  in  the  simulations.  The 
model  has  40  lattices  in  the  X  direction  and  20  lattices  in  the  Y  direction.  The  size  of  the 
model  is  44*39.  The  atom  is  assumed  to  be  cubic  with  a  volume  of  unity. 
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Fig.  4:  Two  examples  to  calculate  the  atomistic  strains  (a)  Tension  and  (b)  Shear 

The  First  simulation  is  simple  tension  as  shown  in  Fig.  4  (a).  The  model  is 
constrained  in  the  X  direction  on  the  left  while  a  constant  velocity  is  applied  on  two 
layers  lattices  on  the  right.  The  strain  histories  of  the  atom  initially  at  (24.5,  11.2) 
calculated  using  the  virial  formula  and  the  new  incremental  approach  are  plotted  in  Fig.  5 
(a).  It  can  be  seen  that  in  general  the  strains  calculated  from  different  methods  agree 
reasonably  well.  The  virial  strain  shows  more  oscillations  than  the  incremental  strain.  The 
interpolation  between  the  nodes  and  the  atoms  is  also  a  spatial  averaging  process.  Two 
cell  sizes,  2  and  2.5  are  used  in  the  background  grid  to  investigate  the  effect  of  cell  size. 
It  can  be  seen  that  oscillation  of  the  strain  are  smaller  when  the  cell  size  is  bigger  and 
vice  versa  because  the  nodal  quantities  are  interpolated  from  a  bigger  number  of  atoms.  A 
large  content  of  noise,  as  a  result  of  random  thermal  vibration  of  the  atoms,  has  been 
suppressed  due  to  averaging  over  a  number  of  atoms.  For  comparison,  the  strain  from 
continuum  simulation  using  the  finite  element  method  is  also  plotted,  assuming  the  model 
material  to  be  homogeneous,  isotropic,  and  linearly  elastic.  It  is  seen  that  all  simulations 
show  that  the  strain  first  increases  at  time  between  t  =  2.5  (any  unit??)  and  3.  The 
calculated  time  is  t  =  2.8  based  on  the  longitudinal  stress  wave  speed. 

The  second  simulation  is  a  shearing  problem  as  shown  in  Fig.  4  (b).  Constant 
upward  velocity  is  applied  on  the  two  lattices  on  the  right.  The  resultant  shear  strain  at 
(24.5,  1 1,2)  is  plotted  in  Fig.  5  (b)  from  these  simulations.  The  virial  strain  again  shows 
largest  variation,  while  the  strain  defined  herein  shows  much  less  vibration. 
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Fig.  5:  Strain  histories  from  different  simulations 
it  may  be  noted  that  computation  of  the  incremental  atomic  strain  does  not  depend 
on  the  atomic  potential  function  and  the  lattice  structure,  The  deformation  field  is 
recovered  to  the  grid  nodes  using  interpolation.  Interpolation  also  allows  averaging  on  the 
displacement  field  so  that  the  computed  atomic  strain  shows  less  oscillation.  It  is  known 
that  the  viriat  formula  is  not  applicable  to  atoms  on  the  surface,  Zimmermann,  1999.  The 
incremental  formulation  developed  in  this  investigation,  however,  does  not  have  this 
restriction.  The  strains  of  the  atom  at  the  top  surface  (in  the  middle  of  the  model)  from 
three  computations  are  plotted  in  Fig.  6.  It  can  be  seen  that  prediction  of  incremental 
formulation  is  reasonably  in  good  agreement  with  the  finite  element  analysis. 


Fig.  6:  Comparison  of  the  normal  strain  of  a  surface  atom  in  tension 


13 


104 


The  ability  to  compute  the  strains  for  the  boundary  and  surface  atoms  is  essential 
for  coupling  simulations.  For  example,  the  strain  at  the  crack  surface  is  required  in 
coupling.  It  may  be  noted  that  the  virial  formulation  computes  a  transient  strain  so  that  it 
does  not  depend  on  the  strain  history  whereas  the  incremental  strain  proposed  herein  is 
computed  from  the  strain  rate  so  that  it  needs  to  be  computed  at  every  time  increment 
during  the  entire  strain  history  of  interest. 

4.  Coupling  of  GIMP  and  MD  simulations 
4.1  Coupling  scheme 

To  enable  coupling  between  GIMP  and  MD,  the  computational  results  must  be  able 
to  exchange  between  the  atomistic  simulation  from  one  side  and  the  continuum 
simulation  from  the  other.  A  successful  coupling  is  indicated  by  the  seamless  transfer  of 
the  deformation,  internal  forces,  heat,  etc.  at  the  interface  of  two  different  simulation 
regions. 

In  the  coupling  scheme  presented  in  this  paper,  heat  transfer  between  the 
continuum  and  atomistic  regions  are  neglected.  Moreover,  the  transition  region  is 
assumed  to  remain  in  the  linear  elastic  regime  at  this  stage.  Fig.  7  (a)  shows  the  overall 
coupling  model  and  Fig.  7  (b)  shows  details  of  the  transition  zone.  The  MD  region  covers 
a  small  portion  of  the  overall  model  and  the  continuum  region  covers  the  rest.  The 
transition  region,  where  the  communication  between  GIMP  and  MD  simulations  take 
place,  is  divided  into  three  zones,  the  inner  zone,  the  incommunicado  zone  and  the  outer 
zone,  as  shown  in  Fig.  7  (b). 

In  the  inner  zone,  the  information  is  passed  from  MD  to  GIMP.  The  atomic  strain  is 
computed  using  the  incremental  scheme  presented  in  Sections  3. 1  and  3.3.  The  velocity 
and  strain  rate  at  the  material  points  overlapped  within  the  atomistic  region  are  computed 
using  Eqs.  (5)  and  (7)  by  replacing  the  atom  positions  with  the  positions  of  the  material 
points.  Hence,  the  stresses  of  these  points  are  computed  based  on  continuum  theories. 
Consequently,  the  material  points  inside  the  atomistic  region  can  be  updated  by  the 
atomistic  information.  These  material  points  can  participate  in  the  overall  continuum 
computation  to  provide  information  for  the  rest  of  the  continuum  region.  The  innermost 
rectangle  in  Fig.  7  (a)  represents  the  region  in  which  all  material  points  are  updated  based 
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on  the  atomistic  information.  These  material  points  are  shown  in  Fig.  7  (b)  as  filled 
squares.  The  unfilled  squares  are  the  material  points  that  are  updated  in  regular 
continuum  computations. 


Incommunicado  zone 
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(a)  Overall  regions  of  a  model 


Inner  zone  Outer  zone 

(b)  Transition  region  with  background  grid 


Fig.  7:  Illustration  of  coupled  GIMP  and  MD  simulations.  The  circles  represent  atoms 
and  squares  (smaller  than  physical  size)  material  points.  The  material  points 
connect  each  other  without  a  gap  to  represent  continuum. 

The  boundary  atoms  in  the  outer  zone,  shown  as  filled  circles  in  Fig.  7,  update  their 
velocities  based  on  the  interpolation  with  the  background  grid  node  using  Eq.  (5).  The 
nodal  velocities  of  the  background  grid  are  computed  from  the  velocities  of  the  material 
points  weighted  by  the  masses  of  the  material  points,  as 


£5^(x,)m,v,(x„,r) 

V,  (/)  =  -* - - - .  (16) 

P 

Once  the  velocities  of  the  boundary  atoms  are  known,  their  displacements  can  be  found 
from  the  time  integration  of  the  velocities.  At  this  point,  the  interactions  with  interior 
atoms  are  fully  defined  by  the  inter-atomic  potentials  as  in  regular  molecular  dynamics 
simulations. 

In  the  proposed  coupling  scheme,  two  different  zones  are  used  for  communication 
in  different  directions,  i.e.  from  GIMP  to  MD  or  from  MD  to  GIMP,  The  incommunicado 
zone  is  the  gap  that  separates  the  inner  and  outer  zones.  This  zone  serves  as  a  buffer  for 
the  communication  between  the  material  points  and  the  atoms.  The  width  of  this  zone  is  3 
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to  4  layers  of  material  points.  In  this  coupling  scheme,  iteration  to  obtain  the  converged 
solution  in  the  transition  zone  is  not  necessary,  as  the  source  of  information  is  guaranteed 
to  be  correct  and  the  communication  in  each  zone  is  unidirectional.  This  approach  not 
only  simplifies  the  coupling  algorithm  but  also  improves  the  stability  and  reduces  the 
computational  cost. 

The  time  step  for  MD  simulation  is  usually  smaller  than  the  time  step  for  GIMP, 
and  can  be  used  as  the  time  step  in  the  coupled  simulation  to  reach  stability  in  coupled 
simulation.  However,  with  the  use  of  small  MD  time  step  as  the  overall  time  step  in 
coupled  simulation,  the  amount  of  computation  is  enormous.  In  this  coupling  algorithm,  a 
temporal  coupling  scheme  is  developed.  A  temporal  factor  N  is  defined  as  the  ratio  of  the 
GIMP  time  step  to  the  MD  time  step,  that  is, 

N  =  *™>Lt  (17) 

dt/HO 

where  dtGMP  =  cLZ-jE/p  is  the  GIMP  time  step.  L  is  the  cell  size;  c  is  a  constant  factor 

(0. 1  in  this  paper).  In  the  coupling,  MD  simulation  advances  N  time  steps  for  each  GIMP 
time  step.  For  this  reason,  it  is  convenient  to  use  an  integer  for  the  temporal  factor.  In 
computation,  the  GIMP  time  step  is  computed  first  from  the  cell  size  and  stress  wave 
speed.  Next,  the  temporal  factor  N  is  rounded  to  an  integer  based  on  Eq.  (17).  Finally,  the 
GIMP  time  step  is  determined  from  dtCIMP  =  N  ■  dtM[> .  The  reduction  in  computation  time 

as  a  result  of  the  temporal  coupling  scheme  is  significant,  as  indicated  by  numerical 
examples  that  will  follow. 

We  next  summarize  the  coupling  scheme  with  the  aid  of  flowchart  Fig.  8,  The 
materia!  points  inside  the  atomistic  domain  are  updated  based  on  the  nodal  information 
interpolated  from  the  atoms,  These  materia!  points  join  the  GIMP  calculation  to  provide 
boundary  conditions,  but  are  not  updated  again.  The  velocity  and  position  of  the 
boundary  atoms  are  updated  based  on  the  nodal  information  interpolated  from  the 
material  points.  These  boundary  atoms  join  the  MD  calculation  to  provide  boundary 
conditions  as  well,  but  the  velocities  are  not  updated  either.  The  concept  of  temporal 
coupling  is  introduced  by  advancing.  MD  simulation  N  steps  for  every  GIMP  step,  in 
order  to  save  computational  time.  It  may  be  noted  that  the  material  points  further  inside 
the  MD  region  can  be  dumped  to  reduce  the  amount  of  computation  since  only  a  few 
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layers  of  material  points  are  needed  to  provide  boundary  conditions  to  the  exterior 
material  points. 

, _ i _ _ 

One  regular  GIMP  step,  but 
*■  do  not  update  the  material 
particles  inside  MD  region 


Update  the  velocity  of  the 
boundary  atoms 

i — ~ 

One  regular  MD  step,  but  do 
not  update  the  velocity  of  the 
boundary  atoms 


Update  the  material  particles 
inside  MD  region 


Fig.  8:  Flow  chart  of  the  coupling  algorithm  for  one  increment 

4.2  Parallel  processing  of  the  coupled  model 

The  parallel  processing  for  the  GIMP  with  multilevel  refinement  using  SAMRAI 
has  been  discussed  in  detail  in  (Ma,  et  al.  2005),  The  coupling  between  GIMP  and  MD  is 

performed  at  the  finest  level  only.  So  the  finest  level  is  always  larger  than  the  MD 

* 

domain.  The  finest  level  is  divided  into  rectangular  regions  called  patches  according  to 
SAMRAI.  Each  patch  is  assigned  to  one  processor  and  the  communication  between 
patches  is  performed  after  each  patch  is  computed.  To  void  data  transferring  in  the 
coupling  process,  the  MD  region  is  decomposed  into  rectangular  region  with  the  same 
size  as  the  patches.  Each  processor  thus  handles  the  same  spatial  region  for  both  GIMP 
and  MD.  Fig.  9  shows  three  levels  of  refinement  for  GIMP  with  MD  coupled  in  the  finest 
level.  The  dashed  lines  divide  the  finest  level  as  well  as  the  MD  domain  into  six 
rectangular  regions.  Each  region  is  processed  by  a  processor  and  the  communication 
between  the  material  points  and  the  atoms  for  coupling  is  performed  within  the  processor. 
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No  data  transferring  between  the  processors  is  necessary.  The  disadvantage  of  this 
approach  is  that  the  load  balancing  of  the  processors  is  more  difficult  to  adjust  due  to  the 
change  of  the  number  of  atoms  and  material  points  in  computation. 


Fig.  9:  Illustration  of  three  GIMP  grid  refinements  and  the  domain  decomposition  for 

coupling 

5.  Numerical  results 

5.1  Multiscale  simulation  of  mode  I  crack 


Fig,  10:  Coupled  GIMP/MD  simulation  of  a  2D  mode-1  edge  crack.  The  dashed  lines  are 
the  boundaries  of  the  atomistic  domain. 

The  coupling  scheme  is  implemented  into  a  multiscale  simulation  of  a  mode  1  crack 
problem  as  shown  in  Fig.  10.  The  overall  model  size  is  1968x!808  and  the  size  of  the 
MD  domain  is  890x600.  Three  levels  of  refinement  in  GIMP  are  applied.  The  finest  level 
has  a  grid  size  of  4  and  the  two  coarser  grid  sizes  are  8  and  16,  respectively.  The  finest 
level  is  coupled  with  MD  and  is  divided  into  six  patches  as  shown  in  Fig.  9.  It  may  be 
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noted  that  the  sizes  of  the  material  points  and  the  grid  cells  determine  the  wavelength  that 
can  be  transmitted  from  MD  simulation  to  the  continuum.  In  this  simulation,  the  high 
frequency  waves  are  not  desirable  in  the  continuum  region.  Hence,  to  save  computational 
time,  the  minimum  grid  size  is  chosen  as  4.  The  total  number  of  atoms  is  497,1 1 1  and  the 
initial  crack  length  is  a  =  498  at  the  height  Y  =  301.  The  applied  pressure  is  0.3  and  the 
time  step  of  the  MD  simulation  is  0,002.  The  temporal  factor  N  is  chosen  as  40,  and 
therefore,  the  GIMP  time  step  for  the  coupling  level  is  0.08.  For  comparison  purposes, 
the  same  problem  is  simulated  using  GIMP  alone  without  coupling,  Fig.  1 1  shows  the 
stress  distribution  in  the  model  at  simulation  time  t  =  64  when  the  stress  wave  just 
traveled  to  the  transition  region.  Fig.  11  (al)  corresponds  to  pure  GIMP  simulation,  and 
Fig.  II  (b  1 )  is  with  coupling.  Due  to  thermal  vibrations  of  the  atoms,  the  stress  in  the  MD 
region  varies  randomly  between  +/-0. 15-0.2  at  stress  free  state.  The  material  points 
further  inside  the  MD  region  can  be  dumped  to  reduce  the  amount  of  computation.  Fig. 
II  (cl )  shows  the  result  of  this  case  in  which  only  3  layers  of  material  points  immediately 
inside  transition  region  are  kept.  It  is  seen  that  the  stress  distribution  is  not  affected  by 
dumping  the  material  points  inside  the  MD  region.  It  was  observed  that  the  computational 
time  was  reduced  by  about  50%  in  this  case.  Close  to  the  crack  tip,  the  stress  distribution 
is  affected  by  dislocations  and  crack  propagation.  In  later  discussions,  only  the  results 
with  material  points  dumped  are  presented. 
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Fig.  1 1 :  Stress  distribution  for  P  =  0  .3 

Fig.  1 1  also  shows  the  same  model  at  time  t  =  112.  At  this  moment  the  stress 
concentration  at  the  crack  tip  has  developed.  The  crack  just  started  to  open.  The  stress 
fields  from  pure  GIMP  simulation  and  coupled  simulation  are  shown  in  Fig.  1 1  (a2)  and 
(b2),  respectively,  The  difference  of  the  maximum  normal  stress  in  the  Y  direction  is  8%. 
The  coupling  region  is  zoomed  in  and  shown  in  Fig.  12,  corresponding  to  the  region 
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indicated  by  the  dashed  rectangle  in  Fig.  1 1  (c2).  The  materia!  points  taking  information 
from  the  atoms  are  overlapped  inside  the  MD  region.  These  material  points  cannot  be 
seen  in  Fig.  12  due  to  the  high  density  of  the  atoms. 
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Fig.  12:  Simulation  model  with  an  edge  crack  with  P=0.3  at  t=208 
It  may  be  noted  that  the  reduction  of  computational  time  due  to  the  temporal 
coupling  with  N  =  40  is  about  75%,  comparing  with  the  same  simulation  with  N  =  1  (no 
temporal  coupling).  The  GIMP  step  involves  regular  GIMP  calculation  and 
communication  between  patches  and  levels  (refinement  and  coarsening).  The  GIMP 
computation  time  is  significantly  larger  than  that  of  an  MD  step  in  this  example. 
Temporal  coupling  has  reduced  the  computational  time  significantly  while  maintaining 
the  accuracy  of  coupling,  as  manifested  in  Fig.  1 1. 

To  study  the  energy  release  rate  of  mode  I  crack  using  the  coupling  algorithm,  a 
tensile  pressure  of  0. 1 5  is  applied  from  t  =  0  to  t  =  2 1 0.  The  crack  started  to  propagate  at  l 
=  -225  and  stopped  at  t  =  -334.  The  external  input  energy,  total  potential  energy  and 
total  energy  from  simulation  are  plotted  in  Fig.  13.  Due  to  the  numerical  damping,  the 
total  energy  decreases  as  time  increases.  The  energy  loss  at  t  =  360  is  20%.  Numerical 
simulation  was  conducted  for  this  same  model  without  damping  and  the  difference 
between  the  input  energy  and  the  system  energy  gain  is  less  than  8%.  The  potential 
energy  is  the  sum  of  the  potential  energy  of  the  atoms  and  the  strain  energy  of  the 
material  points.  It  increases  at  first  when  the  model  is  stretched.  After  the  crack  starts  to 
propagate  the  potential  energy  drops.  The  energy  release  rate  for  this  simulation  is  plotted 
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in  Fig.  14,  as  well  as  the  crack  resistance.  The  energy  release  rate  fluctuates  as  the  crack 
propagates  and  its  magnitude  is  larger  than  the  crack  resistance.  After  t=330  the  energy 
release  rate  drops  below  the  crack  resistance,  leading  the  crack  to  arrest. 


Fig,  13:  Energies  in  the  model  with  P  =  0,15 


Fig.  14:  Energy  release  rate  P  =  0. 15 

5.2  Multiscale  simulation  of  mode  II  crack 

The  same  model  in  Fig.  10  was  simulated  under  mode  II  condition  by  applying 
velocities  on  the  left  side  of  the  model  as  shown  in  Fig.  15.  The  applied  shearing 
velocities  in  the  X  direction,  are  -0.1  and  0. 1  for  the  top  half  and  bottom  half,  respectively. 
Moreover,  to  avoid  the  contact  of  the  initial  crack  surfaces,  a  velocity  in  the  Y  direction  is 
applied  on  the  left  houndary,  0.03  for  the  top  half  and  -0.03  for  the  bottom  half.  The 
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initial  crack  is  horizontal  with  the  tip  located  at  (67,  301).  This  problem  was  simulated 
with  both  the  coupled  model  and  full  MD  model  at  zero  temperature. 


Fig.  15:  Boundary  conditions  of  the  mode  II  crack  problem 
The  velocities  of  the  atoms  in  the  X  direction  from  pure  MD  simulation  are  shown 
in  Fig.  16  (a).  In  this  simulation,  a  crack  was  introduced  to  the  left  of  the  crack  tip  by 
eliminating  the  interaction  of  the  atoms  on  both  sides  of  the  crack.  As  the  velocities  were 
applied  on  the  boundary,  the  predominant  deformation  in  the  model  is  shearing.  The  path 
of  the  shearing  front  was  observed  to  be  straight,  as  an  extension  to  the  initial  crack.  The 
relative  shearing  motion  between  the  two  layers  of  atoms,  which  generate  dislocations, 
occurred  in  the  stick/slip  mode,  as  indicated  by  Fig.  16  (a).  The  blue  spots  indicate  atoms 
moving  to  the  left  at  a  velocity  of  ~0.7  and  relative  velocity  between  the  atoms  on  the  top 
and  bottom  is  -0.8,  Between  the  blue  spots,  the  relative  velocity  is  very  small,  in  the 
range  of  0  to  0,2.  Most  of  the  shear  surfaces  are  sticking  together  and  slip  occurs  at 
isolated  spots. 
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Fig.  16:  Comparison  of  results  from  pure  MD  and  coupled  simulations  at  t  =  1 1 .2 

The  results  from  coupled  simulation  are  plotted  in  Fig.  16  (b)  for  comparison.  It 
can  be  seen  that  the  velocity  contours  agree  well  with  pure  MD  simulations.  The 
stick/slip  pattern  also  matches.  This  further  validates  the  coupling  algorithm.  It  may  be 
noted  that  for  this  size  of  this  model,  there  are  3.3  million  atoms  in  pure  MD  simulation, 
0.5  million  atoms  and  three  levels  of  material  points  in  the  coupled  simulation.  The 
computation  time  and  memory  for  coupled  simulation  are  8%  and  30%  less  than  the  pure 
MD  simulation,  respectively.  As  the  model  grows  bigger,  the  reduction  in  both 
computation  time  and  memory  size  will  be  more  significant. 

In  order  to  track  the  dislocations  in  the  atoms,  the  relative  displacement  of  each 
atom  with  respect  to  each  neighboring  atom  was  computed  and  the  maximum  relative 
displacement  was  recorded  for  the  atom.  Fig.  17  plots  the  maximum  relative 
displacement  of  the  atoms  at  different  simulation  time  when  the  cut-off  value  of  the 
maximum  relative  displacement  is  set  to  0.15,  i.e.,  atoms  with  maximum  relative 
displacement  of  less  than  or  equal  to  0.15  are  not  shown.  With  this  cut  off  value,  all  the 
visible  atoms  are  those  immediately  on  two  sides  of  the  crack  extension,  as  well  as,  those 
with  stick/slip  pattern.  The  left  end  of  each  plot  is  the  crack  tip  and  right  end  is  the  slip 
front.  The  average  propagation  speed  of  the  slip  frontier  is  ~3.5  during  t  =  64  and  t  =  160. 
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Fig.  17:  Dislocation  path  at  different  simulation  time 

6.  Conclusions 

•  A  coupled  atomistic/continuum  simulation  method  is  presented  by  coupling  MD 
simulations  with  GIMP  simulations.  To  enable  coupling,  a  method  for  the  computation  of 
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atomistic  strain,  based  on  the  computation  of  strain  rate,  is  developed.  The  atomic  strain 
rate  is  computed  by  first  interpolating  the  velocities  of  the  atoms  to  a  background 
Eulerian  grid,  then  computing  the  gradient  of  grid  velocities.  The  computed  atomic  strain 
shows  less  vibration  than  that  computed  by  the  virial  theorem  due  to  noise  reduction  as  a 
result  of  the  interpolation  process.  The  generalized  interpolation  function  is  chosen  as  the 
interpolation  function  for  GIMP  and  the  structured  grid  for  the  background  grid.  The 
atomic  stress  is  then  computed  based  on  strain  using  the  isotropic  and  homogeneous 
constitutive  law. 

The  coupling  algorithm  uses  a  common  background  grid  for  MD  and  GIMP.  The 
velocities  of  the  boundary  atoms  are  computed  from  the  grid  velocities,  which  are 
interpolated  from  the  material  points.  The  material  points  inside  the  MD  region  are 
updated  based  on  the  atomic  information  and  these  points  join  the  rest  of  the  material 
points  in  the  GIMP  computation.  This  approach  ensures  the  compatibility  of  deformation 
and  internal  forces  at  the  MD/GIMP  interface  region. 

*  A  multi-level  refinement  scheme  for  GIMP  is  used  to  refine  the  material  points 
close  to  the  atomistic  size.  The  coupling  algorithm  is  implemented  in  the  Structural 
Adaptive  Mesh  Refinement  Application  Infrastructure  (S^JMRAl)  for  parallel  processing. 
The  finest  GIMP  level  is  coupled  with  the  MD  simulation.  The  MD  region  is 
decomposed  into  domains  with  the  same  geometry  as  the  GIMP  patches.  Each  patch  is 
assigned  to  a  processor  and  the  coupling  between  the  atoms  and  material  points  is 
performed  inside  each  processor  without  extra  inter-processor  communication, 

•  The  coupling  algorithm  was  validated  by  comparing  thecoupling  results  with  pure 
GIMP,  and  with  pure  MD  simulation.  A  mode  1  crack  propagation  problem  was 
simulated  using  the  proposed  the  coupling  technique.  The  stress  filed  of  the  coupled 
model  is  validated  by  comparing  with  pure  GIMP  simulations  in  the  elastic  stage.  The 
energy  release  rate  is  computed  and  it  is  found  that  the  crack  arrests  afier  the  energy 
release  rate  is  less  than  the  crack  resistance.  In  mode!  II  loading,  stick/slip  was  observed 
in  the  crack  front  and  the  results  compared  well  with  pure  MD  simulation.  While  the 
current  implementation  is  in  2D,  the  coupling  algorithm  can  he  applied  to  3D, 

The  coupling  algorithm  presented  here  can  be  extended  to  include  discrete 
dynamics  outlined  in  the  CADD  method  (Curtin  and  Miller,  2003;  Shiari,  Miller  and 
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Curtin,  2005).  A  dislocation  detection  band  can  be  placed  inside  the  inner  zone  in  Fig.  7 
which  in  turn  can  be  fed  into  the  continuum  region,  and  vice  versa. 
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Abstract 

Constitutive  laws  are  critical  in  the  investigation  of  mechanical  behavior  of  single  crystal  or 
polycrystaliine  materials  in  applications  spanning  from  microscale  to  macroscale.  In  this 
investigation,  a  combined  FEM  simulation  and  experimental  nanoindentation  approach  was 
taken  to  determine  the  mechanical  behavior  of  single  crystal  copper  incorporating  the 
mesoplastic  constitutive  model.  This  model  was  implemented  in  a  user-defined  subroutine  in 
3D  ABAQUS/Explicit  code,  Nanoindentation  was  modeled  using  the  multiscale  modeling 
technique  involving  mesoplasticity  and  elasticity,  i.cM  mesoplastic  constitutive  model  was  used 
near  the  local  nanoindentation  region  (where  the  dislocations  arc  generated)  while  elastic 
constitutive  model  was  used  in  rest  of  the  region  in  the  workmaterial.  The  meso-mechantcal 
behavior  of  the  crystalline  structure  and  the  effect  of  the  mesoplastic  parameters  on  the 
nanoindentation  load  displacement  relationships  were  investigated  in  the  FEM  analysis. 
Nanoindentation  tests  were  conducted  on  single  crystal  copper  to  determine  load -displace¬ 
ment  relationships.  Appropriate  mesoplastic  parameters  were  determined  by  fitting  the 
simulated  load -displacement  curves  to  the  experimental  data.  The  mesoplastic  model,  with 
appropriate  parameters,  was  then  used  to  determine  the  stress-strain  relationship  of  a  single 
crystal  copper  at  meso-scale.  The  effect  of  indenter  radius  (3,4-1000  pm)  on  material  hardness 
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under  nanoindentation  was  simulated  and  found  to  match  the  experimental  data  for  several 
indenter  radii  (3.4,  10  and  500  pm).  A  comparison  of  the  topographies  of  nanoindentation 
impressions  in  the  experiments  with  FEM  results  showed  a  reasonably  good  agreement. 

©  2005  Elsevier  Ltd.  All  rights  reserved. 

Keywords  Finite  dement  method  (FEM);  Muliiscalc  modeling;  Mesoplasticity;  Single  crystal  copper; 
Nano  indentation 


I.  Introduction 

Material  behavior  at  all  scales  spanning  from  atomistic  to  continuum  involves 
such  processes  as  elastic  deformation,  dislocation  generation  and  multiplication, 
cleavage,  void/m icrocrack  formation  and  its  growth  into  macrocracks,  and  final 
failure.  In  recent  times,  multiscale  modeling  has  received  increasing  attention  due  to 
its  potential  linkage  between  structure-property  relationships  from  nano*  to  macro¬ 
levels  (King  ct  al.,  1995;  Shenoy  et  al.,  1999;  Ogata  et  al.,  2001;  Smith  el  al.,  2001; 
Shilkrot  et  al.,  2002).  Various  computer  simulation  methods  are  used  in  the 
development  of  scaling  laws  as  a  function  of  spatial  and  temporal  variables.  At  the 
subatomic  level,  quantum  mechanics  or  ab  initio  calculations  are  used  to  develop  the 
potential-energy  functions,  With  increase  in  length  (or  time)  scale,  material  behavior 
is  modeled  using  molecular  dynamics  (MD)  and/or  Monte  Carlo  (MC)  simulations, 
then  to  micro  or  mesoplastic,  and  finally  to  continuum  mechanics  (e.g.  finite  element 
method  (FEM)).  Mesoplasticity  serves  as  an  appropriate  formalism  that  bridges  the 
atomistic  mechanisms  of  deformation  and  fracture  to  the  macroscopic  behavior 
(Hartley.  2003).  This  reinforces  the  importance  of  studying  the  material  behavior 
under  mesoscale,  which  is  the  subject  of  this  investigation. 

Ductile  single  crystal  materials  are  of  considerable  interest  in  many  applications 
including  micro-electro-mechanical  systems  (MEMS),  optical,  and  telecommunica¬ 
tion  systems.  Single  crystals  possess  anisotropic  material  properties  that  depend  on 
the  crystal  orientation.  The  conventional  tensile  tests  are  difficult  to  conduct  at  nano- 
and  micro-scales  to  determine  orientation  dependent  behavior.  Indentation  can  be 
an  alternative  approach  to  tension/compression  to  investigate  the  mechanical 
behavior  of  single  crystals,  Flom  and  Komanduri  (2002)  conducted  indentation  and 
scratching  experiments  on  single  crystal  and  potycrystalline  materials.  They  showed 
marked  anisotropies  in  the  slip  systems  surrounding  the  indentations  depending  on 
the  crystal  orientation  and  direction  of  scratching.  Recently,  the  nanoindentation 
technique  (Oliver  and  Pharr,  1992)  has  become  a  useful  tool  to  determine  the 
mechanical  properties  of  very  small  volume  of  materials. 

Lim  ct  al.  (1998)  and  Lim  and  Chaudhri  (1999)  conducted  nanoindentation  tests 
to  investigate  the  effect  of  indenter  load  (l-100mN)  and  radius  of  the  spherical 
indenter  (7,  30,  200,  and  500  pm)  on  the  nano  and  microhardness  of  polycrystalline 
work-hardened  and  annealed  oxygen-free  copper  (OFC)  specimens.  For  the 
annealed  OFC.  they  reported  a  gradual  increase  in  normal  pressure  or  Meyer 
hardness,  Pm  with  a/ R,  where  a  and  R  are  the  radii  of  the  indent  and  the  indenter, 
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respectively.  A  similar  variation  was  observed  in  the  present  investigation.  They  also 
reported  that  Pm  versus  a/R  data  points  for  indenters  of  radii  60,  200,  and  500  pm  all 
fall  on  the  same  curve.  The  equation  for  the  best  fit  is  given  by  Pm  =  0*623(fl//?)a4y*. 
However,  for  indenters  of  smaller  radii  used  {7  and  30  pm),  the  Pm  values  were 
higher  than  for  the  larger  indenters  (60,  200,  and  500  pm).  Also,  among  the  two 
smaller  indenters,  for  a  given  a/R  value,  the  indenter  with  the  smaller  radius  gave 
higher  values  of  normal  pressure  than  for  the  indenter  with  larger  radius.  From  this 
the  authors  concluded  that  the  higher  Meyer  hardness,  Pm  indicates  a  scaling  effect 
for  the  two  smaller  radii  indenters.  For  the  work-hardened  OFC,  they  reported  an 
initial  increase  in  Pm  with  a/R,  reaching  a  steady-state  value  beyond  a/R  of  0,15. 
They  also  reported  the  value  of  normal  pressure,  Pm,  for  a  given  value  a/R,  to  be  the 
same  for  indenters  of  radii  200  and  500  pm,  but  for  7  pm  it  was  only  slightly  higher 
than  from  200  and  500  pm.  From  this,  the  authors  concluded  that  for  nanoindenta¬ 
tions  and  microindentations  of  work-hardened  copper,  no  scaling  effects  were  found 
as  far  as  Pm  versus  a/R  relationship  is  concerned.  It  will  be  shown  in  the  present 
investigation  that  size  effect  is  not  present  for  a  range  of  indenter  radii  used  for  the 
work-hardened  single  crystal  copper  through  measurement  and  simulation.  It  may 
also  be  noted  that  Urn  and  Chaudhri  (1999)  and  Lim  et  ah  (1998)  conducted 
nanoindemation  studies  on  polycrystalline  copper  specimens  (OFC),  both  annealed 
and  work-hardened,  using  a  Berkovich  diamond  indenter  as  well  as  spherical 
indenters  of  different  radii,  namely,  7,  30,  60,  200,  and  500  pm.  In  situations  where 
indenter  radius  was  much  smaller  than  the  grain  size  and  indentation  was  made  in 
the  middle  of  a  grain,  their  results  might  be  representative  for  the  behavior  of  a  single 
crystal  copper.  The  grain  size  of  OFC  was  reported  to  be  in  the  range  of  30-100  pm. 
While  the  100  pm  grains  may  be  adequate  for  the  smaller  radii,  for  example,  7  and 
30  pm  indenters  to  justify  that  indentation  was  made  on  a  single  crystal,  for  radii 
larger  than  this,  the  boundaries  would  influence  the  flow  of  material  so  that 
indentation  can  no  longer  be  considered  to  be  on  a  single  crystal  unless  the  depth  of 
indent  is  kept  small.  Also,  as  the  sample  used  by  Lim  and  Chaudhri  (1999)  and  Lim 
et  al.  (1998)  is  a  polycrystalline  material,  the  orientations  of  grains  change  from  one 
grain  to  another,  albeit  small,  and  it  is  not  known  what  the  orientation  of  the  sample 
is  for  any  grain. 

Since  nanoindentation  tests  can  now-  be  carried  out  with  ease  while  maintaining 
accuracy  as  high  as  a  few  pN  in  load  and  0.1  nm  in  displacement,  material 
characterization  at  sub-micrometer  scale  is  possible.  However,  the  3D  nanoindenta- 
lion  problem  spanning  from  a  sub-micrometer  scale  to  a  millimeter  scale  needs  to  be 
solved  to  determine  how  the  microstructure  (crystal  orientation,  active  slip  systems, 
etc.)  is  linked  to  the  load-displacement  relationships. 

Mesoplastic  constitutive  relationships  have  been  investigated  by  many  researchers, 
including.  Hill  and  Rice  (1972),  and  Asaro  and  Ncedleman  (1985),  Using 
mesoplastic  theory,  Peirce  et  al.  (1982)  analyzed  numerically  the  nonuniform  and 
localized  deformation  in  ductile  single  crystals  subjected  to  tensile  loading,  Huang 
(1991)  developed  a  material  user  subroutine  incorporating  mesoplasticity  in  the 
ABAQUS  implicit  program.  The  theoretical  framework  of  Hill  and  Rice  (1972)  has 
been  implemented  in  the  2D  subroutine.  Kalidindi  et  al,  (1992)  developed  an  implicit 
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time- integration  procedure  mainly  based  on  the  constitutive  model  or  Asuro  and 
Ncedlcmun  (1985).  Kalidindi  et  al  (1992)  simulated  the  load-displacement  curve  in 
plane-strain  forging  of  copper  that  captures  major  features  of  the  experimental  data 
They  reported  jumps  in  the  calculated  curve  although  no  jumps  were  observed  in  the 
experimental  data.  Yoshino  ct  a).  (2001)  applied  mesoplasticity  theory  in  2D  FEM  to 
simulate  the  dislocation  generation  and  propagation  during  indentation  of  single 
crystal  silicon,  Kysar  (2001)  studied  the  crack  growth  along  a  copper/sapphire  bi¬ 
crystal  interface. 

With  increasing  computational  power  at  reducing  costs,  mesoplastic  constitutive 
relation  can  be  implemented  in  some  FEM  codes  to  solve  complex  3D  problems.  For 
example,  Fivcl  ct  al.  (1998)  developed  a  3D  model  to  combine  discrete  dislocations 
with  FEM  for  nanoindentation  simulation  on  single  crystal  copper.  They,  however, 
did  not  include  the  unloading  portion  of  the  load-displacement  curve,  and  did  not 
compare  simulation  results  with  nanoindentation  data,  Wang  ct  al  (2004)  studied  the 
dependence  of  nanoindentation  pile-up  patterns  and  micro-textures  on  the  crystal¬ 
lographic  orientation  on  single  crystal  copper  using  a  conical  indenter  They,  however, 
reported  an  order  of  magnitude  deviation  of  the  numerical  load-displacement  results 
from  the  experimental  data.  Most  of  these  studies  used  the  material  properties  of 
single  crystal  data  obtained  from  tensile  testing  at  macro-scale.  However,  it  is  essential 
to  address  these  parameters  at  the  submicrometer  scale  when  micro-  or  meso-scate 
behavior  is  of  concern.  To  date,  quantitative  agreement  between  mesoplastic/eiastic 
coupling  simulation  and  nanoindentation/micro-tension  results  does  not  exist.  It  is  the 
objective  of  this  investigation  to  provide  this  information. 

In  this  paper,  a  combined  FEM/nanoindentation  approach  was  developed  to 
determine  the  material  properties  of  single  crystal  copper  incorporating  mesoplastic 
constitutive  model  This  model  was  implemented  in  a  user-defined  subroutine  in  3D 
ABAQUS/Explicit.  The  anisotropy  of  single  crystal  copper  was  considered  using  a 
linear  elasticity  model  with  cubic  symmetry  at  the  continuum  scale.  The  3D 
multiscale  FEM  model  developed  in  this  study  involves  both  mesoplasticity  and 
elasticity,  and  is  used  to  simulate  nanoindentation  with  a  spherical  indenter. 
Concurrently,  nanoindentation  tests  were  conducted  on  a  single  crystal  copper. 
Through  fitting  numerical  solution  with  the  experimental  load- displacement  data, 
the  parameters  in  the  mesoscale  constitutive  model  are  determined.  Using  these 
parameters,  the  stress-strain  relationship  can  be  determined  in  the  uniaxial  stress 
state  at  the  sub-micrometer  scale,  or  to  solve  other  mesoplastic  problems.  Using  this 
relationship  we  have  investigated  the  effect  of  indenter  radius  (3.4- 1000  pm)  on 
material  hardness  in  nanoindentation  by  FEM  simulations. 


2,  Mesoplasticity  model 

2.  /.  Theoretical  background 

Motivated  by  experimental  observations,  it  has  been  postulated  that  plastic 
deformation  in  a  face-centered  cubic  (FCC)  crystal  is  confined  to  well-defined  slip 
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systems,  For  an  FCC  lattice,  plastic  deformation  occurs  on  the  slip  systems  defined 
by  the  (1  I  1}  family  of  slip  planes  in  the  (l  1  0)  family  of  slip  directions.  These  12  slip 
systems  govern  the  macroscopic  distortion  of  the  crystal. 

The  kinematics  and  constitutive  relationships  of  mesoplasticity  that  follow  those 
of  Peirce  et  al.  (1983)  are  used  in  this  paper.  The  velocity  gradient  can  be  expressed  as 

L  =  D  +  ft,  (1) 

where  D  is  the  symmetric  part  of  the  rate  of  deformation  and  ft  is  the  skew 
symmetric  spin  tensor.  D  and  ft  are  additively  decomposed  into  elastic  parts  (De  and 
ftc)  and  plastic  parts  (Dp  and  ftp),  respectively,  as  follows: 

D  =  Dc  +  Dp,  (2) 


ft  =  fte+ftr.  (3) 

The  elastic  parts  correspond  to  elastic  deformations  and  lattice  rotations,  whereas 
the  plastic  parts  represent  the  plastic  shear  slips  of  the  lattice.  In  the  context  of  the 
deformation  of  FCC  crystals,  Dp  and  ftp  are  related  to  the  plastic  shear  strain  rate 
on  each  slip  system  as 

12 

dp  =  v  (4) 

i=l 

pi*)  _  ^(b<*)  0  „<*>  +  n«  0  bw),  (5) 


ftp  =  ^2  w(*y*\ 

*=i 


(6) 


w<m  _  i(bm  e  „(*>  _  rtf*)  ®  (7) 

where  n(*}  denotes  the  normal  vector  of  the  slip  plane,  b(*'  is  the  slip  direction,  k  is  the 
number  of  slip  system,  ®  indicates  the  dyadic  product  of  two  tensors  or  vectors,  and 
yik)  is  the  shear  strain  rate. 

When  a  stress  <t  is  applied  to  a  material,  the  resolved  shear  stress  r(**  on  each  slip 
system  can  be  computed  as 


T(*)  _  p(A')0i 


(8) 


■;(A)can  be  expressed  as  a  rate-dependent  power-law  proposed  by  Hutchinson  (1976) 


y(*>  =y„sgn(rw) 


<9) 


where  is  the  rate  sensitivity  exponent,  y0  is  the  shear  strain  rate  at  a  reference 
condition  and  Tq  1  denotes  the  current  shear  strength  of  the  slip  system.  1  develops 
with  the  evolution  of  slips  on  active  slip  systems  due  to  the  accumulation  of 
dislocations  in  a  crystal,  i.e.,  work  hardening.  Since  work  hardening  of  slip  systems 
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depends  on  the  shear  deformation,  variation  of  ]  is  estimated  by 

=  Z>M>’(,1I.  00) 

tel 

where  contains  the  hardening  moduli.  The  diagonal  components  of  In,  represent 
the  self-hardening  moduli,  whereas  the  off  diagonal  components  represent  the 
latent-hardening  moduli.  In  this  study,  anisotropic  hardening  rule  shown  by  Eq.  (1 1) 
is  introduced  in  the  simulation  (Yoshino  ct  al.,  1997). 

hki  =  OL&ki  +  fiftkh  (11) 

where  a  and  ft  are  parameters,  //*,  denotes  the  hardening  matrix  of  dislocation 
interaction  defined  by  Bassani  and  Wu  (1991).  According  to  their  study,  //*,  has  five 
independent  parameters,  taking  into  account,  the  interactions  of  dislocations  in 
various  slip  systems.  In  this  study,  the  hardening  law  has  included  both  self¬ 
hardening  and  latent-hardening  of  each  slip  system  as  well  as  the  interactions  of 
dislocations  among  different  slip  systems. 

Assuming  that  the  crystal  elasticity  is  unaffected  by  slip,  the  elastic  constitutive 
equation  is  given  by 

Je  =  Ce  :  D',  (12) 

where  C  is  the  tensor  of  elastic  moduli  having  the  major  and  minor  symmetries.  For 

v 

cubic  crystals,  there  arc  three  independent  components,  Cn,  Co  and  C44.  <re  is  the 
Jaumann  rate  of  Cauchy  stress  o,  which  is  the  co-rotational  stress  rate  in  terms  of  the 

v 

coordinate  system  that  rotates  with  the  lattice.  is  determined  by 

oe  =  n  +  fip  ■  a  -  a  ■  «p,  (13) 

where  o  is  the  co-rotational  stress  rate  on  the  coordinate  system  that  rotates  with  the 
material.  Combining  Eqs,  (4),  (6),  (12),  and  (13)  leads  to  the  constitutive  equation  of 
mcsoplasticity  in  the  form  of 

12 

l  =  Ce  :  D  -  ^  [(’ W'*’  0-0  Ww)  +  Ce :  ( 1 4) 

k=l 

The  mesoplastic  constitutive  law  expressed  by  Eq.  (14)  to  model  the  meso- 
mcchanieal  behavior  of  single  crystal  copper  is  used  in  this  investigation. 

2.2.  Numerical  implementation 

Based  on  the  theory  presented  in  Section  2.1,  the  mesoplastic  constitutive  model 
is  implemented  in  a  commercial  finite  element  code  (ABAQUS/Explicit),  using  a 
user-defined  subroutine  VUMAT  (ABAQLJS  Manual  2003),  Details  of  the 
numerical  implementation  arc  described  in  the  following.  To  account  for  the 
arbitrary  crystal  orientations,  two  coordinate  systems,  namely,  global  (specimen) 
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and  local  (crystalline)  coordinate  systems  are  introduced  in  the  simulation.  The 
global  coordinate  system  is  fixed  to  the  reference  configuration  while  the  local 
coordinate  system  is  aligned  with  the  crystal  lattice  and  rotates  in  the  same  way  as 
the  lattice.  At  the  beginning  of  each  time  increment,  the  global  strain  increment,  the 
time  increment,  global  stress,  and  the  solution-dependent  state  variables  are  given  to 
the  subroutine  from  the  main  program  in  ABAQUS,  The  user-defined  subroutine 
transforms  the  global  strain  increment  and  stress  into  the  local  system. 

The  mesoplastic  constitutive  calculations  are  performed  in  the  local  coordinate 
system.  In  this  coordinate  system,  the  incremental  stress  is  computed  and 
transformed  into  the  global  system.  At  the  end  of  each  incremental  step,  the  stress 
and  state  variables  are  updated  for  use  in  the  main  program  in  ABAQUS.  Then,  the 
next  load  increment  is  applied  and  a  new  strain  increment  is  generated.  This  loop  is 
repeated  until  computation  at  all  incremental  time  steps  is  completed. 


3,  Nanoindentation  tests 

An  MTS  Nano  Indenter  (XP  system)  was  used  for  nanoindentation  tests  on  single 
crystal  copper  specimen  to  obtain  load-displacement  relationships.  The  system  can 
reach  a  maximum  indentation  depth  of  500  pm  and  a  maximum  load  of  500  mN.  The 
load  resolution  is  50  nN  and  the  displacement  resolution  is0.2nm.  All  nanoindenta¬ 
tion  tests  were  performed  in  air  at  room  temperature  (23  °C).  The  tests  do  not  start 
until  a  thermal  equilibrium  state  is  reached  and  the  drift  of  the  indenter  tip  drops 
below  a  set  value,  typically,  0.05  nm/s.  After  the  indenter  tip  has  made  contact  with 
the  specimen  surface,  the  indentation  load  is  gradually  applied  to  the  surface  of  the 
specimen.  The  indentation  load  and  displacement  are  recorded  simultaneously  at  a 
sampling  rate  of  five  data  points  per  second.  Spherical  diamond  indenter  with  a 
radius  of  3.4  pm  was  used  in  the  nanoindentation  tests. 

A  single  crystal  copper  (10  x  10  x  1  mm)  was  cut  from  an  ingot  using  electrical 
discharge  machining  (EDM).  The  surface  was  chemo-mechanicatly  polished  to  a 
surface  roughness,  Ra  of  9.08  nm.  Each  free  surfaces  in  the  specimen  corresponds  to 
the  (1  00)  direction,  and  the  upper  and  lower  surfaces  are  the  (100)  faces.  Fig.  1  is  a 
schematic  of  the  nanoindentation  setup. 


4.  Combined  FEM/nanoindentatiun  approach 

This  section  describes  the  combined  FEM/nanoindentalion  approach  used  to 
determine  the  material  properties  for  single  crystal  copper  incorporating  the 
mesoplastic  constitutive  model  Various  parameters  in  the  mesoplastic  model  arc 
determined  through  a  fitting  procedure  that  allows  good  correlation  in  the 
load-displacement  relationship  between  the  simulation  results  and  the  nanoindenta¬ 
tion  data.  Nanoindentation  provides  the  load-displacement  curve,  as  well  as  some 
basic  material  parameters,  such  as  Young's  Modulus  and  surface  hardness.  On  the 
other  hand,  for  a  single  crystal  copper,  the  mesoplastic  constitutive  model  and  the 
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Fig.  1 .  Schematic  of  nanoindentation  of  a  single  crystal  copper 


elastic  model  are  used  in  the  finite  element  analysis  to  simulate  the  nanoindentation 
tests  and  to  obtain  numerical  load-displacement  relationships.  If  the  load-displace¬ 
ment  relationship  determined  from  the  numerical  analysis  and  experiment  agrees,  the 
parameters  in  the  mesoplastic  model  would  be  suitable  for  the  single  crystal  copper, 

Wc  next  turn  to  the  description  of  FEM  analysis  of  nanoindentation  on  a  single 
crystal  copper.  The  size  of  the  workpiece  in  the  FEM  model  is  15  x  15  x  10pm. 
Although  this  is  much  smaller  than  the  actual  dimensions  of  the  specimen  used  in 
nanoindentation,  it  is  much  larger  than  the  maximum  indentation  depth  (~300nm). 
So,  the  condition  of  indentation  into  a  half-space  is  justified.  The  3D  finite  element 
mesh  is  shown  in  Fig.  2(a).  There  are  2688  eight-node  brick  elements  and  3208  nodes 
in  this  model.  The  size  of  the  smallest  elements  is  140  nm.  Since  the  primary  focus  in 
the  specimen  is  the  material  directly  underneath  the  indenter,  a  very  fine  mesh  is  used 
near  the  indenter  tip  and  a  coarser  mesh  for  the  remaining  region.  Fig,  2(b)  shows 
the  mesh  in  the  vicinity  of  the  indenter.  The  dimensions  of  the  nanoindemation 
involve  multiple  length  scales,  which  are  reflected  in  the  simulations,  This  is 
implemented  in  the  simulations  using  constitutive  equations  at  two  length  scales, 
namely,  meso-  and  continuum-scales.  Therefore,  the  workpiece  is  partitioned  into 
two  parts  and  the  corresponding  constitutive  relationships  are  assigned.  The 
dimensions  of  the  region  where  mesoplastic  constitutive  model  is  applied  are 
6  x  6  x  10  pm. 

The  nanoindentation,  as  shown  in  Fig.  1,  is  simulated  on  a  single  crystal  copper 
specimen  oriented  in  the  [100]  direction.  The  indenter  used  is  a  rigid  sphere  with  a 
radius.  R  of  3.4  pm.  A  frictionless  contact  pair,  implemented  by  two  contact  surfaces 
with  associated  nodes  between  the  indenter  and  workpiece  is  defined.  All  nodes  on 
the  bottom  surface  of  the  workpiece  are  constrained  along  the  [100]  direction  and  a 
constant  velocity  is  applied  to  simulate  the  indenter  motion  during  both  loading  and 
unloading.  Since  explicit  dynamic  finite  element  analysis  is  conducted,  the  time  step 
increment  is  determined  based  on  the  smallest  element  dimension  in  the  mesh  and  the 
dilatational  wave  speed.  In  this  study,  a  fixed  time  step  increment  60  ps  (lps  = 
10-12s)  is  chosen  as  a  compromise  between  stability  of  the  dynamic  simulation  and 
the  overall  computational  time.  A  total  time  step  increments  of  66,967  in  the 
simulation  are  used,  including  loading,  intermediate  transition,  and  unloading.  The 
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Fig.  2.  FEM  mesh  used  for  the  nanomdcniauon  simulations:  (a)  overview  on  the  mesh;  (b)  zoomed-m 
view  of  localized  indentation  area. 


intermediate  transition  step  entails  ramping  down  the  velocity  to  remove  the 
numerical  discontinuities  in  the  resistance  force  caused  by  the  sudden  change  in  the 
direction  of  the  indenter  motion. 


5.  Results  and  discussion 

5.J<  Nano  indentation 

The  nanoindentation  tests  were  conducted  on  a  single  crystal  copper  specimen. 
The  maximum  indentation  displacement  was  310nm.  For  each  indentation 
displacement,  nanoindentation  tests  were  conducted  at  four  different  locations  to 
ensure  repeatability  of  the  experimental  data.  The  actual  load-displacement  results 
are  shown  in  Fig,  3,  These  four  curves  arc  very  close  to  each  other  and  the  maximum 
deviation  in  load  at  the  maximum  displacement  is  0,07  mN.  Loading  involves 
nonlinearity  induced  by  the  nonlinear  material  behavior  and  increasing  contact  area 
(geometric  nonlinearity).  Unloading  is  purely  elastic.  However,  as  the  contact  area 
decreases  with  decrease  in  indentation  displacement,  geometric  effects  lead  to  a 
nonlinear  unloading  curve, 

5.  2 .  De  t  erm  ina  t  io  n  of  me  so  plus  t  k  par  am  efers 

The  finite  clement  computational  scheme  is  implemented  in  the  ABAQUS/Exphcit 
code  and  used  to  model  the  nanoindentation  problem.  As  stated  in  Section  2.1, 
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rig,  3.  Experimental  load-displacement  plots  at  maximum  indentation  depth  of  3l0nm. 


mcsoplastic  constitutive  equation  used  herein  depends  on  the  plastic  strain  rate.  In 
the  numerical  algorithm,  m  is  a  numerical  parameter  that  is  adjusted  to  satisfy  the 
Schmid's  law.  When  m  =  0,  Schmid's  law  is  satisfied  accurately.  However,  this 
condition  is  difficult  to  implement  in  computations  due  to  numerical  instability.  To 
resolve  this  problem,  m  should  be  slightly  larger  than  zero,  for  e.g,  0,02,  With  such  a 
small  value  of  mf  the  shear  stress  that  activates  a  slip  system  almost  corresponds  to 
the  yield  shear  stress.  The  condition  m  ^  1  implies  that  shear  stress  x{k)  is 
proportional  to  sliding  rate  y{kK  a  relation  identical  to  a  Newtonian  fluid.  Such  a 
situation  is  not  realistic  in  crystal  plasticity.  Additionally,  in  Eq.  (9),  the  initial  value 
of  the  current  strength  Tq  *  is  the  same  in  all  slip  systems,  and  is  given  as  a  user  input 
parameter  rm.  As  xm  decreases,  the  slip  system  becomes  easier  to  activate.  As  a  result, 
higher  plastic  deformation  incurs  at  a  given  indentation  force. 

We  turn  next  to  finding  the  appropriate  parameters,  Tin  and  the  hardening  moduli, 
a  and  /i  in  mesoplasticity  via  a  direct  comparison  between  the  simulation  results  and 
the  corresponding  experimental  data.  Through  multiple  trial  runs,  using  the 
parameters  given  in  Table  1,  a  reasonably  good  agreement  between  the  numerical 
and  experimental  load-displacement  curves  can  be  achieved  as  shown  in  Fig.  4.  It 
can  be  seen  that  the  numerical  model  has  captured  major  features  as  well  as  the 
magnitude  of  the  experimental  data.  The  resisting  force  on  the  indenter  increases  as 
the  indentation  depth  increases  and  drops  precipitously  upon  withdrawal  of  the 
indenter.  The  simulated  curve  agrees  well  with  the  experimental  curve  at  loading/ 
unloading. 

To  investigate  the  sensitivity  of  mesh  size  on  the  convergence  of  the  numerical 
solution,  the  mesh  size  with  the  smallest  clement  size  of  140  am  was  used  as  a 
benchmark.  Three  additional  mesh  cases,  namely,  10%  smaller,  10%  larger,  and 
50%  larger  were  tested  for  the  same  model  and  the  simulation  results  were 
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Table  l 

Material  parameters  of  single  crystal  copper 


Cm  (CPu) 

C,2  (GPa) 

C,  (GPa) 

Tin  (MPa) 

Yn 

*  (MPa) 

//(MPa) 

nr 

16B  4 

1214 

75.4 

34,8 

L0 

4.0 

0.6 

0,02 

Fig.  4.  Comparison  between  numerical  and  experimental  load-displacement  curves. 


compared.  For  the  cases  of  10%  larger  and  10%  smaller,  the  two  load  “displacement 
curves  exhibited  the  same  basic  features  as  the  benchmark  curve,  and  the  maximum 
deviation  among  these  curves  was  5.6%.  For  the  case  of  50%  larger  element  size, 
more  oscillations  with  amplitude  of  —  0.5  mN  were  observed  in  the  load-displace¬ 
ment  curve  and  the  maximum  deviation  from  the  benchmark  curve  was  9.3%. 

Additionally,  since  the  current  model  size  is  large  enough  with  respect  to  the 
nanoindentation  depth  of  310  nm}  only  the  dimensions  of  the  part  with 
mcsoplasticity  were  changed  to  test  the  numerical  results.  Two  different  cases, 
namely,  6  x  6  x  10  pm  and  12  x  12  x  10  pm  were  simulated  and  a  maximum  error  in 
the  load  ^displacement  curves  was  6.2%.  Based  on  the  sensitivity  analysis,  it  can  be 
concluded  that  the  simulation  results  have  converged  with  respect  to  the  mesh 
spacing  for  the  benchmark  mesh  size  and  the  model  size  used. 

Although  there  is  good  overall  agreement  between  the  experimental  and  the 
simulation  results,  the  loading  portion  of  the  simulation  curve  is  not  smooth,  despite 
various  attempts  to  obtain  a  smoother  curve.  While  the  dynamic  characteristics  of 
the  simulation  might  explain  partially  the  bumps  in  the  curve,  it  cannot  explain 
all,  as  most  portions  of  the  curve  arc  still  fairly  smooth*  It  may  be  noted  that 
similar  nonsmooth  loading  curves  in  indentation  were  also  reported  by  others 


129 


K  Liu  et  ctl  /  J.  Meek,  Phy\.  Solids  53  (2005)  2718-2741  2729 


Nomina*  Strain 


Fig-  5,  Stress-strain  curve  for  single  crystal  copper  at  meso-scale. 


(Kalidindi  ct  al.,  1992;  Smith  ct  a!.,  2001).  Smith  et  al.  (2001)  attributed  this  problem 
to  the  finite  size  of  the  elements.  Kalidindi  et  at.  (1992)  postulated  that  these  jumps 
can  be  minimized  by  using  a  finer  mesh  on  the  surface  of  the  sample.  The  jagged 
load-displacement  curve  presented  here  shows  a  trend  similar  to  what  others  have 
observed  at  the  meso-scale.  However,  further  investigation  is  needed  to  determine  if 
there  is  an  underlying  mechanism  for  the  jagged  load-displacement  curve,  or,  if  it  is 
only  due  to  computational  limitations. 

Various  parameters  obtained  using  the  combined  FEM/nanoindentalion  approach 
are  then  employed  to  determine  the  stress-strain  relationship  of  single  crystal 
copper  at  meso-scale.  Fig.  5  shows  the  stress-strain  curve  of  single  crystal  copper  at 
meso-scale, 

5.3.  Mechanical  behavior  of  single  crystal  copper  at  mesoscale 

The  nanoindentation  simulations  were  carried  out  along  the  [1*00]  direction  using 
a  spherical  indenter.  The  parameters  in  Table  1  were  used  for  the  mesoplastic 
constitutive  model.  Figs.  6(a)  and  (b)  show  the  distribution  of  out-of-plane 
displacement  profiles  at  two  stages  of  indentation.  They  show  the  deformation  on 
the  (100)  orientated  surface  with  a  four-fold  symmetry.  Flom  and  Komanduri 
(2002)  observed  the  same  deformation  patterns  in  indentation  experiments  and 
Wang  ct  al.  (2004)  also  reported  similar  results  using  conical  indenters  from  FEM 
simulations  and  experiments.  Both  positive  and  negative  values  of  displacement  can 
be  seen  from  Fig.  6(a).  The  positive  values  occur  symmetrically  around  the  indent 
impression  and  represent  the  pile-up  patterns  of  the  material.  With  increase  in 
indentation  depth,  the  pile-ups  become  more  and  more  pronounced  (see  Fig.  6(b)). 


2730 


Y.  Liu  er  ul  /  J  Meek  Phy j.  SMs  53  (2005)  2? i 8-274 1 


1001 1 


<a> 


(6) 


Fig,  6.  Distributions  of  out-of-plane  dispheements  at  two  stages  of  indentation:  (a)  lOOnm;  (b)  3IOnm. 
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Fig.  7.  Comparison  of  FEM  and  AI  M  images  of  the  out-of-plane  displacement  at  an  indentation  depth  or 
3 10  run:  {a)  Numerical  results;  (b)  AFM  image. 


Figs.  7(a)  and  (b)  show  a  comparison  of  the  numerical  out-of-plane  displacement 
plot  and  the  AFM  image  after  unloading  the  indenter.  Even  though,  it  is  somewhat 
difficult  to  precisely  capture  in  the  simulations  all  phenomena  observed  in 
experiments,  both  figures  show'  some  similar  features,  for  example,  a  four-fold 
symmetry.  Since  self-hardening  and  latent-hardening  for  each  slip  system  have 
already  been  taken  imo  account  in  the  hardening  law,  the  effect  of  other  active  slip 
systems  on  the  dislocation  activities  in  a  lattice  have  also  been  included  in  the 
simulation.  It  is  also  possible  that  the  radius  of  the  styli  may  not  be  exactly  spherical, 
especially  with  the  smaller  diameter  ones. 


131 


Y.  Liu  et  at.  /  J  Meek  Phys.  Solids  53  (2005)  2718-2741  2731 


Fig.  8.  Shear  stress  ex:  distributions  in  the  (100)  plane  al  two  stages  of  indentation:  (a)  lOOnm;  (b) 
310nrn. 


Figs.  8(a)  and  (b)  show  the  shear  stress  <t,*j  distribution  at  two  stages  of 
indentation.  Using  a  spherical  indentcr  in  nanoindentation,  the  stress  waves 
propagate  towards  the  four  vertices  in  the  (100)  plane  and  the  stress  magnitudes 
increase  with  the  indentation  displacement.  Figs.  9(a)  and  (b)  show  the  distribution 
of  normal  stress  oyv  (y-axis  coincides  with  the  [100]  direction)  in  the  mid-section  (the 
normal  direction  is  along  [00  l])  of  the  single  crystal  at  two  stages  of  indentation. 
The  stress  distribution  is  symmetric  with  respect  to  the  [100]  direction.  These  figures 
also  indicate  that  maximum  compressive  stress  occurs  right  underneath  the  indentcr 
with  its  magnitude  increasing  with  indentation  depth,  and  the  domain  of  the 
compressive  stress  increases  with  the  indentation  displacement.  Tensile  stress  zones 
become  larger  and  larger  with  increasing  indentation  displacement,  causing  more 
significant  pile-ups. 

Fig.  10  shows  a  zoomed-in  view  of  the  pile-up  on  a  displacement  plot  in  the  same 
section  as  that  in  Fig.9,  after  unloading  the  indenter.  Figs.  1 1(a)  and  (b)  show  the 
distributions  of  the  shear  strain  in  the  mid-section  at  two  stages  of  indendation.  The 
total  shear  strain  is  calculated  using  the  following  equation: 

y  =  £lV[>|.  (15) 

it 

The  shear  strain  is  related  to  the  status  of  each  slip  system  by  the  Schmid's  law,  and 
is  associated  with  the  dislocation  density  as  well.  As  Figs,  1 1(a)  and  (b)  show,  the 
shear  strain  distribution  is  symmetric  with  respect  to  the  [10  0]  direction. 

5  4  Effect  of  friction  on  nanoindentation  behavior 

We  now  compare  the  pile-up  profiles  of  the  indent  between  the  numerical  and 
experimental  data.  The  dashed  line  in  Fig.  12  shows  the  surface  profile  scanned 
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(a) 

Fig.  9  (jyi  distributions  in  the  <00  J)  section  <it  two  stages  of  indentation:  (a)  100 nm;  (b)  3l0nm. 
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Fig.  10.  Pile-up  of  the  indentation  region  in  the  (00  l)  section  after  unloading 


(a)  {hi 

Fig.  1 1  Distributions  of  shear  strain  (total  shear  strain  on  all  the  slip  systems)  in  the  <00  1)  section  at  two 
stages  of  indentation:  (a)  tOOnm;  (h)  3lf)nm. 


through  the  center  of  the  indentation  impression  along  a  line  in  the  [0  l  1}  direction 
(as  determined  from  the  AFM  image  in  Fig.  7).  Also  shown  is  the  surface  profile 
determined  from  the  numerical  analysis  for  a  frictionless  contact  condition  (i.c.* 
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Fig.  12.  Comparisons  of  the  pile-up  profiles  for  different  COF 


coefficient  of  friction,  COF  =  0)  with  the  highest  material  pile-up.  It  can  be  seen  that 
the  depth  of  indentation  is  reasonably  in  good  agreement  with  the  depth  as 
determined  from  the  AFM  image  (~285nm).  The  material  pile-up  from  the 
numerical  analysis  for  a  COF  of  zero  is  125  nm,  while  the  value  from  the  experiment 
is  60  nm.  Even  though,  the  material  pile-up  is  within  the  same  order  of  magnitudes 
for  both  cases,  the  values  differ  by  a  factor  of  about  two.  This  difference  can  be 
attributed  partially  to  friction. 

The  analysis  conducted  thus  far  has  focused  on  a  frictionless  contact  between  the 
nanoindenter  and  workpiece.  It  may,  however,  be  noted  that  friction  does  play  a  role 
at  the  contact  surface  and  can  affect  the  surface  profile.  To  investigate  this,  we 
introduced  Coulomb  frictional  contact  in  the  FEM  simulation  of  nanoindentation. 
Because  the  COF  is  unknown,  we  used  several  values  of  COFs  between  p  =  0  and  0.8 
in  the  simulations,  attempting  to  find  a  COF  that  would  give  us  the  same  surface 
profile  as  measured  by  the  AFM.  A  constant  COF  is  used  in  each  simulation.  Fig.  12 
shows  the  simulated  surface  profile  in  the  [0111  direction  scanned  through  the  center 
of  the  impression  for  COFs  between  0  and  0.4.  The  results  indicate  that  pile-up 
decreases  as  COF  increases.  We  also  find  that  as  the  COF  reaches  0.4,  further 
increase  in  the  COF  does  not  cause  any  additional  reduction  in  the  material  pile-up. 
This  is  most  likely  because  nonslip  contact  condition  has  been  reached  at  a  COF 
equals  to  or  near  0.4,  so  that  increasing  COF  simply  maintains  the  same  nonslip 
contact  condition  and  does  not  contribute  to  further  reduction  in  the  material  pile- 
up.  At  COF  of  0.4,  the  simulated  material  pile-up  is  75  nm,  close  to  60  nm  which  is 
the  material  pile-up  measured  from  AFM.  To  the  best  of  knowledge  of  the  authors, 
this  difference  is  small  among  the  published  literature  at  this  scale  lending  further 
confirmation  on  the  appropriateness  of  the  mesoplastic  parameters  determined  from 
the  combined  FEM/nanoindcntation  approach. 
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We  next  address  the  depth  of  impression  as  shown  in  Fig.  12.  The  indentation 
depth  is  constant  and  independent  of  the  COF.  Because  the  indentation  depth  is  not 
affected  by  COF,  the  load-displacement  curve  should  not  be  affected  by  the  COF  as 
well.  It  is  seen  that  the  load-displacement  curves  are  almost  identical  within  the 
range  of  limited  numerical  noise  (see  Fig.  13  for  COFs  between  0  and  0.4).  As  a 
result,  this  combined  FEM/nanoindentation  approach  directed  towards  finding  the 
mcsoscale  parameters  in  single  crystal  copper  through  correlating  numerical 
load-displacement  curve  with  the  nanoindentation  counterpart  is  not  influenced 
by  the  COF  at  the  contact  surface, 

5.5.  Effect  of  spherical  indent er  radius  on  the  measured  mechanical  properties 

Material  parameters  determined  from  nanoindentation  tests  can  be  size- 
dependent,  namely,  indentation  depth  and  indenier  radius.  It  may  be  noted  that 
theoretical  framework  for  mesop  fas  deity  used  in  this  investigation  does  not 
encompass  the  size  effect.  Ideally,  it  can  be  applied  to  any  scale  of  indentation 
provided  the  material  has  the  same  crystalline  characteristics.  In  reality,  however,  the 
nanoindentation  size  effect  may  result  in  changes  in  the  yield  stress  and  the 
hardening  law. 

Material  hardness,  as  one  of  the  important  mechanical  properties,  can  be 
measured  using  the  nanoindentation  technique  with  spherical  indenters  of  different 
radii.  To  address  this  problem,  numerical  simulations  were  conducted  using 
spherical  indenters  with  different  radii  (3.4,  7,  10,  25,  50,  100,  200,  500  and 
1000  pm)  at  corresponding  indentation  depths  (310,  638,  912,  2280,  4560,  9120, 
18,240,  45,600  and  91,200  nm).  FEM  results  indicate  that  pile-up  on  the  indent 
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Fig.  14.  Results  or  simulation  showing  the  variation  or  mean  pressure.  Pm\  (a)  with  indent  radius,  a  for 
different  indenter  radii,  R\  (b)  with  a/R  for  different  indenter  radii. 


surface  using  a  spherical  indenter  with  a  smaller  radius  is  more  evident  than  with  a 
larger  radius.  This  is  because  for  a  given  depth  of  indentation,  smaller  radii  indenters 
are  less  blunt  than  larger  radii  indenters. 

Fig.  14(a)  shows  the  variation  of  mean  pressure,  Pm  with  indent  radius,  a  for 
different  indenter  radii,  fi  on  a  semi-log  scale.  It  shows  that  normal  pressure 
increases  with  increase  in  the  indent  radius.  It  also  shows  that  this  increase' is  higher 
for  spherical  indenters  of  smaller  radii  than  with  larger  radii,  especially  for  lower 
values  of  the  indent  radii.  Fig.  14(b)  shows  the  variation  of  mean  pressure,  Pm  with 
the  ratio  of  the  indent  radius  to  the  indenter  radius,  (a/R)  for  different  indenter  radii, 
R.  It  is  seen  that  a  single  curve  can  be  fitted  for  Pm  versus  (a/R)  for  different  indenter 
radii,  when  the  indenter  depth  is  normalized  with  respect  to  the  indenter  radius.  The 
best  fit  equation  is  Pm  =  0.9583 (a/R)omi.  The  small  differences  in  this  case  are 
attributed  to  the  work-hardening  behavior  of  the  sample  as  well  as  minor  numerical 
instabilities  due  to  mesh  distortions.  Lim  and  Chaudhri  (1999)  found  significant 
differences  in  the  variation  of  mean  normal  pressure  Pm  with  (a/R)  between  the 
work-hardened  and  annealed  OFC  samples.  For  the  work-hardened  sample,  the  Pm 
value  for  a  given  (a/R)  is  only  slightly  greater  for  the  indenter  of  7  pm  radius  than  for 
the  other  two  indenters  (200  and  500  pm)  for  which  it  is  the  same.  The  simulation 
results  in  this  paper  show  a  similar  trend.  To  confirm  the  work-hardened  behavior  of 
the  work  material,  the  indents  were  scanned  under  AFM.  In  general,  scanning  along 
a  line  in  a  typical  AFM  image  of  an  indent  impression  shows  pile-up  in  a  work- 
hardened  material  and  sink-in  in  the  case  of  an  annealed  specimen  (McElhancy  et  al., 
1998;  Lim  and  Chaudhri,  1999),  In  the  case  of  single  crystal  copper  in  our  case,  we 
scanned  the  surface  along  a  line  on  an  indent,  and  plotted  the  height  profile  in 
Fig.  12.  A  pile-up  is  observed  around  2.5  pm.  Additionally,  a  pile-up  is  observed 
around  4  pm  along  a  line  on  an  indent  formed  by  impression  of  a  Berkovich  indenter 
(Fig.  15).  The  pilc-ups  in  indents  formed  by  both  spherical  and  Berkovich  indenter 
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Fig.  15.  AFM  (mage  of  an  indent  impression  made  with  a  Berkovich  indenter. 


Fig.  16.  (a)  Results  or  FEM  simulations  showing  the  variation  of  the  indentation  force,  F  with  indent 
diameter,  2 a  for  different  indenter  radii,  R,  <h)  Results  or  indentation  experiments  showing  the  variation  of 
the  indentation  force,  F  with  indent  diameter.  2a  for  different  indenter  radii,  R. 

impressions  indicate  that  the  single  crystal  copper  is  close  to  work-hardened 
condition. 

In  Figs,  14(a)  and  (b)t  the  variable,  indent  radius,  is  involved  in  both  axes.  To 
eliminate  this  effect,  the  variation  of  normal  force  with  indent  diameter  is  plotted  on 
a  log-log  scale  (Fig.  16(a)).  It  shows  that  most  of  the  data  points  fall  on  a  single  line 
and  independent  of  the  indenter  radius  used  (in  this  case  from  3.4  to  1000  pm).  A 
straight  line  can  be  fitted  between  the  indenter  force  and  the  indent  radius.  The 
equation  for  the  best  fit  is  given  by  F  =  0.4 101  (2a)20359,  where  F  is  the  indenter  force 
and  2 a  is  the  indent  diameter,  it  is  interesting  to  note  that  this  equation  is  very 
similar  to  the  equation  normally  used  at  the  macrolcvel  for  spherical  indenters, 
namely,  P  —  A(2aft  where  the  exponent,  n  is  called  the  Meyer  index  (1908).  The 
value  of  n  is  reported  to  vary  from  2  for  fully  strain-hardened  metals  to  2.5  for 
annealed  metals  (Shaw  and  DcSalvo,  1972),  The  value  of  the  exponent,  n  obtained  in 
the  present  investigation  is  2.0359,  indicating  the  work-hardening  behavior  of  the 
single  crystal  copper  specimen.  Fig,  16(a)  shows  very  little  dependence  of  the 
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indenter  radius  on  the  norma)  force  at  the  microscale,  similar  to  the  ones  observed  at 
the  macroscale.  This  equation  was  shown  to  hold  for  indenters  of  diameters  in  the 
range  of  1-30  mm  by  Krupkowski  (1931). 

In  order  to  compare  the  mean  normal  pressure  values  obtained  from  simulations, 
nanoindentation  and  microindentation  experiments  were  conducted  for  several  available 
spherical  indenters  (3.4,  10  and  500  pm),  The  nanoindentation  tests  of  spherical  diamond 
indenters  (radii  3,4  and  10  pm)  were  performed  on  the  MTS  Nano  Indenter  XP  system, 
whereas  the  micro  indentation  tests  of  spherical  tungsten  carbide  indenter  (radius 
500  pm)  were  conducted  on  a  Clark  Micro  Hardness  Testers  (Model  CM-700AT). 
The  experimental  data  are  presented  in  Fig.  16(b).  The  best  fit  equation  is  F  - 
0, 3867(2 a)2'0352.  The  exponent  as  obtained  from  the  experiment  2,0352  is  very  close  to  the 
simulation,  which  is  2.0359.  From  these  results,  it  can  be  seen  that  the  approach  of  using 
combined  nanoindentation/FEM  simulation  for  determining  mesoplastic  model  para* 
meters  works  reasonably  well  from  the  microlevel  to  the  macrolevel. 


6.  Discussion 

The  mesoplastic  constitutive  law  used  in  this  paper  is  a  mechanism-based  plasticity 
model  derived  from  rate-dependent  crystallographic  theory.  It  is  capable  of 
capturing  the  effects  of  crystal  orientations.  The  approach  of  combining  numerical 
simulation  with  nanoindentation  experiments  allowed  the  determination  of  the 
stress-strain  relation  at  mesoscale.  Numerical  simulations  of  nanoindentation  using 
appropriate  parameters  for  material  behavior  at  these  scales  have  a  reasonably  good 
agreement  with  the  nanoindentation  results  on  indentation  mean  pressure  and 
surface  pilcup.  It  is  noted,  however,  that  the  material  length  scale  is  not  explicitly 
included  in  the  current  mesoplastic  constitutive  law.  Therefore,  it  is  applicable  only 
to  problems  where  size  effects  do  not  dominate  the  material  behavior;  it  is  currently 
not  capable  of  modeling  the  size  effects  such  as  the  indenter  size-dependent  mean 
pressure  as  reported  for  annealed  polycry stalline  copper  by  Lim  and  Chaudhri 
(1999),  Within  the  framework  of  numerical  simulations,  the  length  scale  issues  can  be 
potentially  incorporated.  At  the  continuum  level,  there  are  usually  three  formula¬ 
tions  to  take  into  consideration  the  size  effects  (Ncedleman,  2000):  discrete 
dislocation  plasticity,  nonlocal  plasticity  and  the  coupling  of  matter  diffusion  and 
deformation.  In  discrete  dislocation  plasticity  (Amodeo  and  Ghonicm,  1990; 
Gulluoglu  and  Hartley,  1993;  van  der  Giessen  and  Necdlcman,  1995),  the 
dislocations  are  modeled  as  line  singularities  in  an  elastic  medium.  The  plastic  flow 
is  represented  by  the  collective  motion  of  a  number  of  discrete  dislocations. 
Dislocation-based  plasticity  has  a  characteristic  length,  the  Burgurs  vector,  and  is 
capable  of  capturing  the  size-dependent  phenomena.  In  the  nonlocal  plasticity 
theory  (Fleck  et  ah,  1994;  Nix  and  Gao,  1998;  Shu  and  Fleck,  1998),  the  stress  tensor 
or  higher  order  stress  tensor  depends  on  both  strains  and  strain  gradients.  Different 
formulations  for  nonlocal  plasticity  have  been  presented  (see  Needlcman,  2000  for  a 
review).  However,  there  is  no  unified  formulation  in  nonlocal  plasticity  applicable  to 
both  single  crystals  and  polycrystalline  materials  at  all  length  scales.  Whether  a 


138 


2738  Y.  Liu  et  at  /  /  Meek  Pf\yx  Solids  53  {2005}  2718-2741 

nonlocal  plasticity  theory  is  suitable  or  not  depends  ultimately  on  comparison  with 
experimental  data.  Consideration  of  the  size-dependence  through  the  coupling  of 
matter  diffusion  and  deformation  is  usually  important  at  high  temperatures,  such  as 
the  temperature  encountered  in  the  manufacturing  process  of  semiconductor  devices. 
In  this  case,  surface  diffusion  drives  the  formation  of  islands  due  to  the  deformation 
of  thin  films.  Zhang  and  Bower  (1999)  found  that  islands  will  form  if  the  initial 
roughness  on  the  surface  of  the  film  exceeds  certain  critical  wavelength. 

The  work  presented  herein  indicates  that  there  is  no  size-dependence  in 
stress-strain  relation,  as  well  as  hardness  on  work-hardened  single  crystal  copper. 
In  situations  where  size-effects  are  present,  as  in  the  case  of  annealed  copper,  size 
effects  need  to  be  introduced  in  the  formulation  of  numerical  simulation.  The 
approach  used  in  this  paper  will  be  expanded  in  a  future  work  to  incorporate  size- 
dependence  in  the  formulation. 


7,  Conclusions 

1.  A  multiscale  finite  element  model  involving  meso-plasticity  and  elasticity  was 
used  for  nanoindentation  of  single  crystal  copper.  A  combined  FEM  simulation  and 
nanoindentation  experimental  approach  was  used  to  determine  the  parameters  for 
the  mesoplastic  constitutive  model.  Nanoindentation  tests  were  conducted  to  obtain 
the  load-displacement  characteristics.  The  parameters  for  mesoplastic  constitutive 
model  were  determined  by  fitting  the  numerical  load  displacement  curves  to  the 
experimental  data.  Numerical  results  indicate  that  hardening  parameters  a  and  fi  and 
initial  critical  shear  stress,  tin  have  a  strong  effect  on  the  nanoindentation 
load -displacement  relationship.  A  reasonably  good  agreement  between  numerical 
and  experimental  results  on  the  load -displacement  curve  was  obtained. 

2.  Mesa-mechanical  behavior  of  single  crystal  copper  was  investigated.  The 
distribution  of  displacements,  stresses,  shear  strains,  and  shear  stresses  were  analyzed 
at  various  stages  of  indentation.  Numerical  results  show  the  deformation  on  the 
{1  0  0)  orientated  surface  in  nanoindentation  has  a  four-fold  symmetry  similar  to  the 
AFM  image  of  nanoindentation.  This  is  consistent  with  the  results  reported  by  Flom 
and  Komandun  (2002)  and  Wang  cl  ai,  (2004). 

3.  With  increase  in  indentation  depth,  material  pile-up  around  the  indenter  also 
increases.  The  pile-up  was  more  pronounced  with  smaller  indenter  radius.  FEM 
simulation  results  also  indicate  that  pile-up  decreases  as  the  coefficient  of  friction 
(COF)  increases  from  0  to  0,4,  while  the  nanoindentation  load- displacement 
relationship  is  found  to  be  independent  of  the  coefficient  of  friction.  Simulated 
material  pile-up  at  COF  of  0.4  compared  reasonably  well  with  the  measured  piic-up. 
The  maximum  shear  strain  associated  with  the  dislocation  density  occurs  underneath 
the  top  surface  which  can  lead  to  macrocrack  formation.  Various  parameters 
determined  from  the  combined  FEM/nanoindentution  are  then  used  to  determine 
uniaxial  stress-strain  relationships  at  microscale. 

4.  The  variation  of  hardness,  Pm  with  a/R  shows  an  increasing  trend  in  hardness 
with  increase  in  (a/R)  and  independent  of  indent  diameter.  The  equation  for  the  best 
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fit  is  given  by  Pm  =  0.9583(a/R)OWS4.  It  is  interesting  to  note  that  this  equation  is 
very  similar  to  the  equation  at  the  macrolevel  Tor  spherical  indenters,  namely, 
P  =  A(2a)n,  where  the  exponent,  «  is  the  Meyer  index  (1908).  This  equation  was 
shown  to  hold  for  indenters  of  diameters  in  the  range  of  1-30  mm  by  Krupkowski 
(1931).  It  is  shown  in  the  present  investigation  that  this  effect  can  be  attributed  to  the 
work-hardening  behavior  of  the  material  in  simulations  using  spherical  indenters  of 
radii  varying  from  3.4  to  1000 pm.  Similar  results  were  reported  by  Urn  et  al.  (1998) 
and  Lim  and  Chaudhri  (1999)  for  polycrystalline  work-hardened  copper  for 
indenters  of  radii  from  7  to  500  pm. 

5.  The  variation  of  normal  force  was  found  to  increase  linearly  with  indent 
diameter,  on  a  log-log  plot,  and  independent  of  the  indenter  radius.  The  equation  for 
the  best  fit  is  given  by  F  =  0.4l0l(2a)10359,  where  F  is  the  normal  force  and  2a  is  the 
indent  diameter.  The  value  of  n  is  2.0359.  The  range  of  values  of  n  reported  to  vary 
from  2  for  fully  strain-hardened  metals  to  2.5  for  annealed  metals  (Shaw  and 
DcSalvo,  1972).  The  value  of  the  exponent  n  obtained  in  the  present  investigation  is 
2.0359,  indicating  the  work-hardening  behavior  of  the  single  crystal  copper 
specimen.  The  exponent  as  determined  from  experiments  is  2.0352  which  is  also 
very  close  to  the  simulation  results. 

6.  The  results  of  the  present  investigation  (Figs.  16(a)  and  (b))  indicate  that  use  of 
combined  nanoindentation/FEM  simulation  approach  for  determining  mesoplasitc 
model  parameters  works  reasonably  well  from  microlevel  to  the  macrolevel. 

7.  The  method  presented  in  this  study  has  provided  an  approach  to  determine  the 
microstructura!  parameters  through  nanoindentation  tests  and  numerical  simula¬ 
tions  that  can  be  used  for  quantitative  analysis  for  a  wide  range  of  problems 
involving  mesoplastic  behavior. 
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Abstract 

Nanometric  cutting  of  single-crystal  materials  at  conventional  cutting  speeds 
(5 ms"1)  is  simulated  for  the  first  time  using  a  new  Monte  Carlo  method  that 
is  applicable  to  systems  that  are  neither  canonical  nor  microcanomcal.  This 
is  accomplished  by  defining  a  local  temperature  in  the  cutting  zone  using 
the  thermal  analysis  developed  by  Komanduri  and  Hou  for  conventional 
machining.  Extension  of  this  method  to  the  nanometric  regime  permits  an 
accurate  estimate  of  the  local  temperature  in  cutting.  This  temperature  is  then 
employed  in  the  Boltzmann  probability  distribution  function  that  is  used  to 
determine  the  acceptance-rejection  of  Monte  Carlo  moves  in  the  simulation. 
Since  cutting  speed  is  closely  related  to  cutting  temperature,  the  cutting  speed 
enters  the  calculation  via  the  thermal  analysis  equations.  The  method  is  applied  to 
nanometric  cutting  of  single-crystal  aluminium  with  the  crystal  oriented  in  the 
(001)  plane  and  cut  in  the  [100]  direction.  Three  positive  rake  cutting  tools, 
namely  10°,  30°  and  4 5°,  are  employed  to  investigate  the  effect  of  the  rake 
angle  on  the  forces,  the  specific  energy  and  the  nature  of  the  chip  formation. 
The  method  is  evaluated  by  direct  comparison  with  corresponding  molecular 
dynamics  simulations  conducted  under  the  same  conditions. 


Nomenclature 
a  thermal  diffusivity  (Jem2) 
a/r  acceptance- to-  reject  ion  ratio 
A  area 

AB  length  of  shear  plane  heat  source  (A) 
i?dnp  fraction  of  the  heat  conducted  into  the  chip 
A B  maxim um  compensation  for  heat  partition 
I)  equilibrium  dissociation  energy  (eV) 

F  force  (nN) 

Fc  cutting  force  (nN) 

Fft  frictional  force  (nN) 

Fs  shear  force  (nN) 

Ft  thrust  force  (nN) 
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J  joule's  mechanical  equivalent  of  heat 
K  Boltzmann  constant 
K0(u)  Bessel  function 

L  length  of  tool-chip  interface  frictional  heat  source  (A) 
ma  mass  of  an  atom 
Pec  let  number 

AAfooi  number  of  atoms  in  the  tool 
iVwp  number  of  atoms  in  the  workpiece 
P{q o)  Boltzmann  distribution  function 

f/pt  heat  liberation  intensity  of  a  moving-line  heat  source  (Jem-  s-  ) 
qpii  heat  liberation  intensity  of  a  moving-induced  plane  heat  source 
(J  cm-1s-1) 

heat  liberation  intensity  of  a  moving-plane  heat  source  (Jcm“ls-1) 
f/o  configuration  of  N  particles  in  three-dimensional  space 
r  interparticle  distance  (A) 
rc  equilibrium  distance  (A) 

R  distance  between  the  moving-line  heat  source  and  the  point  M  (A) 

/  depth  of  cut  (A) 

/ch  chip  thickness  (A) 

T  Temperature  (/Q 

v  velocity  of  moving  plane  heat  source  (ms-1) 
vc  cutting  velocity  (mf1) 
chip  velocity  (ms-1) 

\\  velocity  of  the  fth  atom  (m s-1) 

V  Morse  potential  (eV) 

Tmin  minimum  potential  (eV) 

AV  change  in  Morse  potential  (eV) 
w  width  of  cut  (A) 

X%  z  coordinates  of  the  point  at  which  the  temperature  rise  occurs  in  the 
moving  coordinate  system  (m) 

Greek  letters 

a  rake  angle  (deg) 

$  temperature  rise  (°C) 

temperature  rise  at  any  point  ( °C ) 
k  thermal  conductivity  (Jem-1  s-loC-1) 
fjjsr  uniform  random  number  between  0  and  1  of  the  ith  Markov  move  on  the 
Nth  atom 

p  density  (gem-3) 

0  shear  angle  (deg) 

(p  oblique  angle  (deg) 


§L  Introduction 

Molecular  dynamics  (MD)  simulation  is  a  powerful  technique  for  modelling  the 
cutting  process  on  a  nanoscale  (Komanduri  and  Raff  2001)*  However,  there  are  three 
significant  limitations  associated  with  this  process,  namely:  (i)  the  availability  of 
accurate  potential  energy  surfaces:  (ii)  the  need  to  use  high  cutting  speeds  (100 
500ms-1)  and  (iii)  the  consideration  of  a  fewer  number  of  workpiece  atoms  (a 
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few  thousand).  The  latter  two  are  used  to  limit  the  computational  time  to  a  reason¬ 
able  value.  These  limitations  have  been  of  serious  concern  to  most  researchers  work¬ 
ing  in  this  field  for  they  would  like  to  consider  larger  workpieces  (a  few  million  atoms 
or  more)  and  conventional  cutting  speeds  (2-5  ms-1)  to  simulate  nanometric  cutting. 
Currently,  this  is  not  possible  to  accomplish  with  MD  simulation  even  using  the 
fastest  state-of-the-art  computers,  including  massive  parallel  processors. 

One  way  to  overcome  the  latter  two  of  the  above  limitations  is  to  use  a  Monte 
Carlo  (MC)  method,  since  time  or  cutting  speed  does  not  enter  directly  into  this 
method  (Komanduri  et  al.  2002).  Also,  some  of  the  detailed  time-consuming  calcula¬ 
tions  conducted  in  MD  simulations  are  of  little  practical  interest.  For  example, 
detailed  information  concerning  the  precise  movement  of  the  atoms  at  all  instants 
of  time  before  reaching  their  equilibrium  positions  may  not  be  of  interest.  If  we 
employ  a  method  that  does  not  address  such  issues  of  no  significance,  valuable 
computational  time,  as  well  as  memory  space,  can  be  saved. 

In  this  investigation,  nanometric  cutting  is  simulated  at  conventional  cutting 
speeds  (5ms_l)  using  a  new  MC  method  (Narulkar  2002)  that  is  applicable  to 
systems  that  are  neither  canonical  nor  microcanonical.  Even  though  cutting  speed 
docs  not  directly  enter  into  the  MC  calculations,  its  effect  can  be  introduced 
indirectly  via  the  temperature  generated  in  the  cutting  zone,  since  cutting  speed  is 
related  to  cutting  temperature.  The  thermal  analysis  method  developed  by 
Komanduri  and  Hou  (2000,  2001a,  b)  for  conventional  cutting  is  extended  to  the 
nanometric  regime  to  permit  local  temperatures  in  the  cutting  region  to  be  estimated. 
This  temperature  is  then  employed  in  the  Boltzmann  probability  distribution  func¬ 
tion  (PDF)  that  is  used  to  determine  the  acceptance-rejection  of  MC  moves  in  the 
simulation.  The  method  allows  MC  simulations  of  nanometric  cutting  to  be  con¬ 
ducted  at  conventional  cutting  speeds  (5ms-1)  without  burdening  the  system  with 
significant  computational  time  as  in  MD  simulations,  MC  simulations  of  nanometric 
cutting  were  conducted  at  a  conventional  cutting  speed  (5  ms-1)  with  three  positive 
rake  angle  tools,  namely  10°,  30°  and  45°  to  investigate  the  effect  of  the  rake  angle  on 
the  forces,  the  specific  energy  and  the  nature  of  the  chip  formation  process. 

52.  Monte  Carlo  simulation  of  nanometric  cutting 

MC  methods  have  been  applied  extensively  in  such  diverse  fields  as  engineering, 
materials  science,  mathematics,  medicine,  chemistry  and  physics  (Rubinstein  1981). 
The  basis  for  MC  methods  is  that  the  configuration  of  the  physical  system  is  describ- 
able  by  a  probability  density  function  (Metropolis  and  Ulam  1949,  von  Neumann 
1951).  MC  methods  have  proven  to  be  an  effective  way  of  reducing  the  total  com¬ 
putational  time.  The  actual  time  reduction  depends  upon  many  factors,  including  the 
cutting  speed,  the  number  of  atoms  in  the  workpiece  Nwp ,  and  the  type  of  potential 
used. 

At  high  cutting  speeds  of  about  500  ms-1,  MD  will  generally  be  the  method  of 
choice,  for  there  is  not  much  gained  by  using  MC  simulation.  As  the  cutting  speed 
decreases,  MC  methods  will  quickly  become  significantly  more  efficient.  Figures  1  (a) 
and  (b)  show  the  variation  in  the  computational  time  with  the  number  of  atoms  and 
cutting  velocity  vf  respectively  for  MD  and  MC  simulations  in  uniaxial  tension  and 
nanometric  cutting,  respectively.  When  the  number  of  atoms  considered  is  very 
small,  the  difference  in  computational  time  between  the  two  methods  is  small 
(Figure  1  (a)).  For  example,  Komanduri  et  al.  (2002)  have  shown  that  a  combined 
MC -damped  trajectory  (DT)  method  for  nanometric  uniaxial  tensile  testing  of  fee 
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Figure  1.  Variation  of  computational  time  with  (a)  the  number  of  atoms  in  uniaxial  tensile 
testing  in  MD  and  MC  simulations  and  (b)  the  cutting  velocity  in  MD  and  MC 
simulations. 


metals  takes  about  16%  of  the  time  required  for  MD  simulation  for  a  system  of 
2500  atoms.  This  difference  increases  significantly  as  the  number  of  atoms  considered 
increases,  for  example  to  10 000  atoms  or  higher.  This  trend  would  be  the  same  in  the 
case  of  nanometric  cutting  as  well.  This  difference  will  be  even  more  pronounced  in 
nanometric  cutting  as  the  cutting  speed  decreases  from  about  500  ms”1  to  practical 
cutting  speed  values  (about  2-5  m  s-1)  as  shown  in  figure  I  (b).  This  is  because,  while 
there  is  not  much  difference  between  the  computational  times  in  the  MC  and  MD 
simulations  at  a  high  cutting  speed  (500  ms-1),  with  decrease  in  cutting  speed  (about 
2  5ms”1),  MD  simulation  times  increase  proportionately  with  decrease  in  cutting 
speed  while  the  time  required  for  MC  simulation  is  nearly  independent  of  cutting 
speed.  Further,  MC  computations  will  generally  become  even  more  favourable  as  the 
complexity  of  the  potential  increases,  for  example  from  a  simple  pairwise  empirical 
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potential  to  a  complex  potential,  such  as  the  embedded-atom  potential,  This  is 
because  MD  calculations  require  the  determination  of  the  derivatives  of  the  potential 
whereas  MC  calculations  require  only  the  potential  itself. 

Most  MD  simulations  of  nanometric  cutting  are  conducted  at  very  high  cutting 
speeds  ( 1 00-500 ms^1)  to  reduce  the  computational  time  (Belak  and  Stowers  1990, 
Komanduri  and  RalT200l}.  The  major  part  of  the  computational  time  is  associated 
with  the  numerical  integration  of  the  classical  Newtonian  equations  of  motion  for 
the  interacting  particles  to  obtain  new  configurations  in  space  over  a  certain  period 
of  time.  If  some  or  these  calculations  can  be  eliminated,  a  significant  saving  in  the 
computational  time  should  result. 

In  MC  methods,  the  computational  time  can  be  reduced  because  the  time  vari¬ 
able  is  not  directly  involved.  The  temporal  dependence  of  the  positions  of  the  atoms 
is  not  computed  using  Newtonian  equations;  instead,  random  Markov  moves  are 
used  to  obtain  a  series  of  equilibrium  configurations.  Figure  2  is  a  schematic  diagram 
of  the  MC  simulation  of  nanomctric  cutting  (Narulkar  2003).  The  workpiece  atoms 
are  divided  into  moving  atoms  and  boundary  atoms.  The  purpose  of  boundary 
atoms  is  to  simulate  the  bulk  and  hence  these  atoms  are  not  allowed  to  move  during 
the  simulation.  Consequently,  interactions  between  boundary  atoms  are  not  calcu¬ 
lated,  All  tool  atoms  are  designated  as  boundary  atoms;  that  is,  the  tool  is  considered 
to  be  infinitely  hard.  Also,  the  boundary  atoms  of  the  tool  do  not  interact  with  the 
boundary  atoms  of  the  workpiece. 

Unlike  in  MD  simulations,  it  is  not  necessary  to  incorporate  a  peripheral  zone 
in  MC  simulations.  The  purpose  of  the  peripheral  atoms  in  MD  simulations  is  to 
dissipate  the  heat  generated  during  the  cutting  process  much  like  the  heat  carried 
away  by  the  chip  and  the  lubricant  in  addition  to  the  cutting  tool  and  the  work 
material.  This  way,  the  effect  upon  the  energy  transfer  within  the  solid  that  would  be 
present  for  an  extended  lattice  model  is  included  in  the  calculations.  In  MC  simula¬ 
tions,  momentum  and  heat  dissipation  terms  are  not  directly  involved.  Hence,  there 
is  no  need  for  peripheral  atoms  in  the  simulation. 

Since  nature  always  drives  any  system  towards  its  most  stable  configuration 
or  minimum  potential  Kmin,  in  MC  simulations,  we  randomly  select  configurations 
that  move  the  system  towards  its  minimum  potential.  In  most  cases,  the  PDF  is 
unknown  so  that  this  selection  cannot  be  accomplished  in  a  straightforward  fashion. 
A  common  method  to  handle  this  type  of  problem  involves  the  generation  of 


•  Boundary  atoms  O  Moving  atoms 

Figure  2,  Schematic  diagram  of  MC  simulation  of  nanometric  cutting  showing  various 
regions  of  interest  (Narulkar  2003). 
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random  configurations  using  Markov  chains  generated  by  the  Metropolis  sampling 
procedure  (Metropolis  and  Ulam  1949,  von  Neumann  1951), 

Consider  an  initial  configuration  %  for  an  N-particle  system.  The  Boltzmann 
distribution  function  P{q 0)  f°r  canonical  systems  is  given  by  (for  example  Raff  (200 1)) 

/’(<®f  =  Cexp^— (1) 

where  V  and  T  are  the  potential  and  the  temperature  respectively  for  configuration 
r/o,  K  is  the  Boltzmann  constant  and  C  is  a  proportionality  constant. 

A  subset  of  m  particles  from  a  total  of  N  particles  is  now  selected  and  the 
coordinates  of  these  particles  are  randomly  varied.  Here,  m  is  very  small  compared 
with  N.  For  example,  in  MC  simulations,  m  is  typically  one  to  five  atoms  in  a  system 
consisting  of  several  thousand  atoms.  For  the  ft h  atom,  the  new  configuration  is 
obtained  using  the  following  equations  (Herrmann  1994): 

(*i)„ew  =  (*|) Old  +  (0-5  -  &|)AXV  (2) 

=  CKiJaid  +  (0,5  —  fea)Ay-  (3) 

UfJnew  =  fo)oid  +  (0*5  -  £a)Az,  (4) 

where  (x)new*  (y/)new  and  (z,)ncw  are  the  new  configurations  in  space  and  (x,)M, 
0’r)oid  and  (z/)oki  are  the  old  configurations  for  the  ith  atom.  Ax  Ay  and  Az  are 
the  maximum  step  sizes  along  the  X\  Y  and  Z  directions  respectively,  and  the 
are  randomly  chosen  numbers  in  the  interval  [0,  I], 

For  canonical  systems,  this  new  configuration  point  will  be  either  accepted  or 
rejected  depending  upon  the  change  in  the  Boltzmann  probability  given  by  equation 
(1).  The  move  is  accepted,  if 


where  is  the  random  number  generated  between  0  and  1  in  the  ith  Morkov  move 
and  AV  —  Fncw  —  However,  since  the  system  possesses  large  temperature 
gradients,  it  is  not  a  canonical  system  and  the  application  of  equation  (5)  is  not 
straightforward.  This  problem  is  addressed  in  §  3. 

By  choosing  m  and  the  maximum  step  size  to  be  very  large,  it  is  possible  to  sample 
all  regions  of  configuration  space  rapidly  for  all  particles  in  the  system.  Consequently, 
the  change  A  V  in  the  potential  will  usually  be  very  large  and  exp[~  V(q0)/KT(q^)\ 
very  small.  Consequently,  the  probability  of  acceptance  of  the  given  Metropolis 
move  will  be  very  low.  Thus,  a  great  deal  of  computational  time  is  wasted  because 
of  the  large  number  of  rejected  moves.  On  the  other  hand,  if  we  take  the  maximum 
step  size  to  be  very  small,  A  V  will  be  very  small  and  the  probability  of  acceptance  will 
be  very  high.  However,  it  will  take  many  accepted  moves  to  sample  the  configuration 
space  adequately  and  to  reach  convergence.  These  two  extremes  are,  therefore,  not 
beneficial.  The  values  of  Ax  Ay  and  Ar  should  be  chosen  such  that  the  acccptancc- 
to- reject  ion  ratio  ajr  is  in  the  range  0.7  ^  a/r  ^  1 .3.  In  this  investigation,  m  is  taken 
as  I  and  the  value  of  maximum  step  size  in  all  directions  as  0,2  A, 

Consideration  of  equation  {1}  shows  that  all  moves  leading  to  lower  potential  are 
accepted.  In  other  words,  if  AV  <  0,  the  move  is  accepted.  Thus,  the  system  even¬ 
tually  moves  to  the  minimum  potential.  On  the  other  hand,  if  the  change  in  the 
potential  is  positive,  that  is  the  potential  rises,  then  the  value  ofcxp[-A  V{q^)IKT{q^)\ 
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must  be  greater  than  the  random  number  generated  for  the  particular  Markov 
move  to  be  accepted.  The  move  is  accepted  with  a  probability  that  decays  exponen¬ 
tially  with  increasing  AK  If  the  move  is  rejected,  the  old  position  of  the  atom  is 
retained.  This  cycle  is  considered  as  one  Markov  move.  The  Markov  moves  are 
repeated  until  a  global  minimum  is  reached. 

Two  methods  are  generally  employed  in  selecting  a  particular  subset  of  m  parti¬ 
cles  to  be  moved  in  a  given  set.  In  the  first  method,  the  m  particles  in  the  subset  are 
selected  randomly  in  each  move  from  the  N  particles  in  the  system.  In  the  second 
method,  some  systematic  method  of  selection  is  used  that  ensures  that  all  particles 
are  moved  before  any  particle  is  moved  again.  For  an  infinite  number  of  Markov 
moves,  both  methods  become  equivalent  but,  for  a  finite  number,  one  method  may 
produce  more  rapid  convergence  than  the  other.  In  the  present  investigation,  one 
atom  is  chosen  randomly  at  any  time  and  is  given  a  Markov  move.  This  process  is 
repeated  until  the  local  potential  is  conserved. 

In  cutting,  a  considerable  amount  of  plastic  deformation  takes  place  in  the  shear 
zone  ahead  of  the  cutting  took  So,  the  atoms  in  the  workpiece  at  or  near  the  tool  in 
the  depth-of-cut  region  are  deformed  much  more  than  the  atoms  further  down  in 
the  workpiece,  which  remain  in  configurations  similar  to  the  bulk  material.  In  other 
words,  the  change  in  potential  of  the  atoms  at  or  near  the  tool  tip  in  the  workpiece  is 
more  than  for  the  atoms  further  down  in  the  workpiece.  The  workpiece  atoms  at  the 
depth-of-cut  region  arc  removed  in  the  form  of  a  chip  from  the  bulk  and  are  moved 
to  an  elevated  potential.  Hence,  a  larger  number  of  Markov  moves  are  required  to 
reach  convergence  for  the  atoms  in  the  cutting  region. 

A  MC  step  consists  of  two  parts.  In  the  first  part,  an  atom  is  chosen  from  the 
atoms  from  the  cutting  region  (zone  I),  as  shown  in  figure  3.  A  Markov  move  is  then 
executed  on  this  atom.  This  process  is  repeated  until  local  potential  converges  to 
a  minimum.  In  the  second  step,  an  atom  is  chosen  randomly  from  the  workpiece 
(zone  2),  including  those  interacting  with  the  tool  and  a  Markov  move  is  executed. 
This  entire  process  is  continued  until  the  overall  system  potential  is  minimized. 
At  this  point,  the  tool  is  advanced  by  a  small  increment  and  the  entire  process 
is  repeated  at  the  new  tool  position. 

The  empirical  potential  used  for  the  MC  simulation  is  a  pairwise  Morse  potential 
as  it  is  found  to  be  simple  and  more  suitable  for  fee  materials  (Komanduri  et  aL 
2001).  It  can  be  represented  as  (Girifalco  and  Weizer  1959) 

^(r)  =  D[l  —  exp[— a{r-re)]}2 


Figure  3.  Schematic  diagram  showing  two  zones  in  the  MC  simulation  of 

nanometric  cutting. 
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or 

(  D{exp[-2a{r-rc)]  -  2exp[-of{r  -  re)]|  for  r  s?  R*, 

=  \  '  r  (6) 

[  0  for  r  >  K  , 

where  r  is  the  distance  between  any  two  atoms  and  rc  is  the  equilibrium  distance,  D  is 
the  equilibrium  dissociation  energy,  R*  is  the  cut-off  radius  and  a  denotes  the  range 
of  interactions.  The  three  parameters  rc,  D  and  a  are  obtained  from  the  closest 
spacing  between  the  atoms  (equilibrium  lattice  spacing),  the  Debye  temperature 
and  the  sublimation  energy,  respectively.  The  cut-off  radius  is  adjusted  so  that 
V<R=R'}/V(r^  }  =  Q.04,  The  accuracy  of  the  function  as  well  as  the  stability  of  the 
crystal  for  a  given  material  are  checked  for  various  properties,  including  the  cohesive 
energy,  the  lattice  constant,  the  compressibility  and  the  elastic  constants  as  well 
as  the  equation  of  state  and  the  stability  of  the  crystal.  The  potential  of  the  entire 
tool  workpiece  system  is  the  pairwise  summation  of  potential  terms  between  the 
lattice  atoms  of  the  workpiece  and  a  second  summation  of  pairwise  Morse  potentials 
between  the  atoms  of  the  tool  and  the  atoms  of  the  workpiece.  It  should  be  noted 
that,  in  this  investigation,  we  are  not  primarily  concerned  with  reproducing  the 
experimental  potential.  The  focus  is  on  the  development  of  MC  methods  for  machin¬ 
ing  applications  at  practical  cutting  speeds. 

§3,  Application  of  the  thermal  model  ok  metal 

CUTTING  TO  NANOMETRIC  CUTTING 

In  metal  cutting  with  a  sharp  tool*  there  are  two  principal  heat  sources,  namely 
the  shear  plane  heat  source  at  the  primary  shear  zone  and  the  frictional  heat  source 
at  the  tool -chip  interface.  In  this  investigation,  the  thermal  model  of  conventional 
metal  cutting  developed  by  Komanduri  and  Hou  (2000,  2001a,  b)  on  a  macroscale  is 
adopted  to  estimate  the  temperature  in  the  cutting  region  in  nanometric  cutting. 

Komanduri  and  Hou  (2000)  first  considered  the  temperature  distribution  in  the 
chip  and  the  work  material  caused  by  the  shear  plane  heat  source  in  machining  using 
Hahn’s  (1951)  model  of  a  band  heat  source  moving  obliquely  in  an  infinite  medium, 
Komanduri  and  Hou  (2001a)  then  considered  the  frictional  heat  source  at  the 
chip-tool  interface,  based  on  the  work  of  Chao  and  Trigger  (1955)  that  considers 
non-uniform  distribution  of  the  heat  partition  fraction  along  the  tool-chip  interface 
for  the  purpose  of  matching  the  temperature  distribution  on  the  chip  side  and  the 
tool  side.  Komanduri  and  Hou  (2001a)  used  the  functional  analysis  approach,  first 
proposed  by  Chao  and  Trigger  (1955),  to  address  the  variable  heat  partition  prob¬ 
lem,  Komanduri  and  Hou  (2001b)  then  combined  the  elTect  of  shear  plane  heat 
source  and  tool-chip  interface  frictional  heat  source  to  determine  the  temperature 
distribution  in  the  workpiece,  the  chip  and  the  tool.  As  this  model  is  incorporated  in 
the  present  investigation,  it  will  be  briefly  presented  in  the  following. 

3.1 .  Combined  effect  of  shear  plane  heat  source  and  tool-chip 
interface  frictional  heat  source 

Figure  4  is  a  schematic  diagram  of  the  heat  transfer  model  showing  the  two 
principal  heat  sources,  namely  the  shear  plane  heat  source  AB  and  the  chip  tool 
interface  frictional  heat  source  OA  operating  simultaneously  on  a  common  coordi¬ 
nate  system.  By  considering  that  part  of  the  heat  flow  from  the  shear  plane  through 
the  chip  and  the  tool-chip  interface  into  the  tool  to  be  continuous,  Komanduri  and 
Hou  (2001b)  found  the  effects  of  the  shear  plane  heat  source  on  the  temperature  rise 
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Figure  4  Schematic  diagram  of  the  heat  transfer  model  with  a  common  coordinate  system 
for  the  combined  effect  of  two  principal  heat  sources,  namely  the  heat  source  in 
the  primary  shear  plane  and  the  frictional  heat  source  at  the  chip-tool  interface 
(Komanduri  and  Hou  2001  b)r 


at  the  tool-chip  interface  on  the  chip  side  as  well  as  on  the  tool  side  to  be  the  same. 
Thus,  the  contribution  of  the  shear  plane  heat  source  on  the  temperature  rise  at  the 
tool  chip  interface  was  included  in  the  total  temperature  rise  for  both  sides. 

Referring  to  figure  4,  both  the  shear  plane  heat  source  and  the  tool-chip 
interface  frictional  heat  source  move  relative  to  the  chip  at  the  chip  velocity  vch 
but  in  the  opposite  direction  to  the  chip  flow.  As  the  entire  shear  plane  heat 
source  is  under  the  upper  boundary  surface  of  the  chip  except  for  point  B  (which 
is  at  the  boundary  surface  all  the  time),  an  image  heat  source  A'B  was  consid¬ 
ered.  Komanduri  and  Hou  (2001b)  used  an  oblique  moving  band  heat  source  for 
an  infinite  medium  for  both  the  primary  shear  plane  heal  source  and  its  image 
heat  source.  Similarly,  the  total  temperature  rise  at  any  point  in  the  tool  caused 
by  the  two  principal  heat  sources  also  consists  of  two  parts.  The  first  is  the 
frictional  heat  source  at  the  tool-chip  interface  from  the  tool  side  and  the  second 
is  due  to  the  shear  plane  heat  source.  However,  a  different  heat  transfer  model  is 
needed  to  address  this  problem.  They  considered  that  part  of  the  heat  coming 
from  the  shear  plane  heat  source  through  the  chip  and  the  tool-chip  interface 
into  the  tool,  acting  as  a  stationary  heat  source  located  at  the  tool  chip  interface. 
Thus,  the  heat  source  in  this  case  is  considered  as  an  induced  stationary  rectan¬ 
gular  heat  source  caused  by  the  shear  plane  heat  source.  The  analysis  of  thermal 
modelling  developed  by  Komanduri  and  Hou  (2000,  2001a,  b)  is  extensive  and 
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somewhat  involved.  Hence,  here  only  the  final  results  of  that  investigation  arc 
given,  For  details,  the  reader  is  referred  to  those  references. 

The  temperature  rise  due  to  frictional  heat  source  at  the  tool-chip  interface  is 
given  by  (Komanduri  and  Hou  2001b) 

'-(X-lih 
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The  temperature  rise  at  any  point  in  the  chip,  including  all  points  along  the 
tool-chip  interface,  caused  by  the  shear  plane  heat  source  is  given  by  (Komanduri 
and  Hou  2001b) 

Unit 


+  Ko(^[(X  -  ^i)2  +  (2/eh  -  2  “  zf)]'/‘)  d“V 


(8) 


The  total  rise  in  the  temperature  at  any  point  in  the  workpiece  is  given  by 
Komanduri  and  Hou  (2001b), 
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(9) 


Komanduri  and  Hou  (2001b)  applied  the  thermal  model  to  two  cases  of  metal 
cutting  at  the  macroscopic  level,  namely  conventional  machining  of  steel  with  a 
carbide  tool  at  a  high  Peclet  number  (jVPe«5-20)  (Chao  and  Trigger  1953)  and 
ultra-precision  machining  of  aluminium  with  a  single-crystal  diamond  tool  at  a 
low  Peclet  number  (AfPe«0.5)  (Ueda  et  al.  1998)  and  found  good  agreement  between 
the  analytical  and  the  experimental  results. 


3.2.  Application  of  the  thermal  model  of  metal  catting  to  Monte  Carlo 
simulation  of  nanometric  cutting 

In  order  to  apply  MC  methods  to  the  non-canonical  systems  present  in  machin¬ 
ing  applications,  we  assume  that  a  local  temperature  can  be  meaningfully  defined  for 
ensembles  of  about  lOOatoms.  This  assumption  permits  us  to  replace  the  tempera¬ 
ture  T  in  equation  (5)  with  one  that  is  dependent  upon  the  spatial  coordinates  of  the 
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atoms  being  moved  in  each  Markov  step.  That  is,  we  replace  Tin  equation  (5)  with 
a  spatially  dependent  temperature  T(^0)  that  is  computed  using  equation  (9), 

As  pointed  out,  the  thermal  model  developed  by  Komanduri  and  Hou 
(2001b)  was  verified  with  the  experimental  results  reported  in  the  literature  at  the 
macroscopic  level.  In  this  investigation,  this  model  is  applied  for  the  first  time  to 
nanometric  cutting.  To  check  the  viability  of  this  model,  MD  simulation  of 
nanometric  cutting  of  single-crystal  aluminium  was  conducted  with  10°  and  30* 
rake  tools  at  a  cutting  speed  of  500  ms^1  using  the  pairwise  potential  previously 
described  and  the  results  compared  with  the  thermal  analysis  of  the  metal-cutting 
process  on  a  nanoscale. 

For  estimating  the  temperature  in  the  MD  simulation,  a  small  region  ahead  of 
the  tool  in  the  cutting  zone  (figure  5)  was  selected  and  observed  continuously  during 
the  entire  simulation.  The  width  of  this  region  is  the  same  as  the  width  of  the 
workpiece.  The  temperature  in  this  region  was  calculated  at  each  tool  movement 
by  assuming  an  equipartitioning  of  the  total  energy  between  kinetic  and  potential 
energy  as  would  be  present  for  an  ensemble  of  independent  harmonic  oscillations 
(figure  6),  This  assumption  leads  to  a  temperature  T  in  the  cutting  region  equal  to 


T  =  \'  1  m*v< 


(10) 


where  P  is  the  total  number  of  atoms  falling  in  the  region  considered,  is  the  mass 
of  the  atom  and  v,  is  the  velocity  of  the  ith  atom  in  the  same  region. 

The  size  of  the  region  is  chosen  so  as  to  ensure  that  there  is  a  sufficient  number  of 
atoms  in  the  region  to  obtain  a  good  statistical  average  value  of  the  temperature. 
Usually,  some  50- 1 00  atoms  are  taken  for  P. 

The  cutting  and  thrust  forces  are  obtained  from  the  MD  simulation  of  nano- 
metric  cutting.  Figures  7(a)  and  (b)  show  the  variations  in  cutting  and  thrust  forces, 
respectively,  with  cutting  time  for  nanometric  cutting  with  a  10°  rake  angle  tool 
and  figures  8(a)  and  (b)  show  the  corresponding  variations  for  a  30°  rake  angle 
tool.  These  forces  are  then  resolved  along  the  shear  plane  and  the  tool  face  using 
Merchant's  (1945)  force  circle  diagram  (see  figure  9  for  details)  to  determine  the 


Figure  5.  Schematic  diagram  showing  the  region  of  observation  in  the  cutting  zone  for 
temperature  estimation  in  nanometric  cutting. 
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Figure  6.  Comparison  of  the  Morse  potential  and  harmonic  oscillation. 


intensity  of  the  heat  sources  at  the  shear  plane  and  the  tool-chip  frictional  interface. 
The  resolved  forces  are  then  used  to  calculate  the  heat  generated  at  the  shear  plane 
and  at  the  tool-chip  interface,  which  are  the  quantities  required  by  equations  (9) 
and  (13). 

The  heat  generated  at  the  chip- tool  interface  is. given  by 

„  _  ^frvch  T  1  t\t\ 

^  —  J  cm  s  *  (I  l  J 


Similarly,  the  heat  generated  at  the  shear  plane  is  given  by 


_  (Fevc  ^ff*'ch) 
/pls  “  ABxw 


Jem  1  s  1 


(12) 


where  Ffr  is  the  frictional  force  acting  along  the  chip-tool  surface,  Fc  is  the  cutting 
force,  L  is  the  length  of  frictional  heat  source,  vCh  is  the  chip  velocity,  w  is  the  width 
of  cut  and  AB  is  the  length  of  the  shear  heat  source  (f/sin^). 

In  the  MD  simulation,  the  tool  is  considered  infinitely  hard.  So,  the  atoms  do  not 
vibrate  at  their  respective  equilibrium  positions;  that  is,  there  is  no  molecular  kinetic 
energy  in  the  tool  atoms,  which  is  reflected  in  the  fact  that  the  temperature  rise  in  the 
tool  is  zero  throughout  the  entire  simulation.  In  other  words,  when  we  apply  the 
thermal  model,  the  tool  does  not  share  the  heat  generated  at  the  chip-tool  interface 
and  as  a  first  approximation,  the  chip  carries  away  all  the  heat.  Therefore,  equation 
(9)  is  simplified  by  setting  AB  —  0  and  Bchip  =  1.0.  Thus, 
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Figure  7.  Variations  in  (a)  cutting  and  (b)  thrust  forces  (moving-average  values)  with  cutting 
lime  in  MD  simulation  of  nanomctric  cutting  of  a  single-crystal,  aluminium  workpiece 
(rake  angle,  10°;  cutting  velocity,  500ms"1;  depth  of  cut,  5.1  A;  width  of  cut,  43A). 
In  this  and  subsequent  figures,  t.u  indicates  time  units. 


§4.  Results  and  discussion 

The  main  objective  of  this  investigation  is  to  conduct  comparison  studies  of  MC 
simulations  of  nanometric  cutting  of  a  single-crystal  aluminium  work  material 
oriented  in  the  (001)  plane  and  cut  in  the  [100]  direction  with  three  positive  rake 
angle  tools,  namely  10°,  30°  and  45°,  at  a  cutting  speed  of  5ms-1  and  to  determine 
the  effect  of  rake  angle  on  the  cutting  and  thrust  forces,  force  ratio,  frictional  force 
on  the  loo!  face,  shear  force  on  the  shear  plane  and  specific  energy,  as  well  as 
the  subsurface  deformation  in  the  work  material. 


4. 1 .  Comparison  of  the  temperature  in  the  cutting  zone  by  thermal  analysis  and 
molecular  dynamics  simulation  of  nanometric  cutting  at  500  m  s~ ' 

The  temperature  in  the  cutting  region  was  calculated  using  equation  (10)  and 
compared  with  the  results  obtained  using  equation  (13).  Figures  10  and  1 1  show  the 
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Figure  8.  Variations  in  (a)  cutting  and  (b)  thrust  forces  (nioving-aveFagc  values)  with  cutting 
time  in  MD  simulation  of  nanometric  cutting  of  a  single-crystal  aluminium  workpiece 
(rake  angle,  30*;  cutting  velocity,  500 ms”1;  depth  of  cut,  5.1  A;  width  of  cut,  43  A) 


variation  in  the  temperature  in  the  cutting  zone  with  cutting  time  from  the  MD 
simulation  and  the  thermal  model  for  cutting  at  500m s”“  with  10°  and  30*  rake 
angle  tools,  respectively.  It  can  be  seen  that  the  agreement  is  very  good.  Once  the 
local  temperatures  in  the  cutting  region  are  known,  they  can  be  substituted  into 
equation  (5)  and  used  in  MC  calculations.  This  method  permits  the  effects  due  to 
cutting  speed  to  be  indirectly  introduced  into  MC  simulations  of  nanometric  cutting. 
This  procedure,  therefore,  permits  MC  simulation  of  nanometric  cutting  to  be 
conducted  at  conventional  cutting  speeds  of  about  Sms"1.  A  similar  simulation 
by  the  MD  method  would  take  an  inordinate  amount  of  time,  even  with  the  fastest 
computers  available. 

4.2.  Force  analysis  for  molecular  dynamics  and  Monte  Carlo  simulations  of  cutting 
In  a  cutting  experiment,  the  X,  Y  and  Z  components  of  the  cutting  force  are 
measured  at  the  tool  edge.  These  components  are  then  decomposed  into  cutting 
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Figure  9.  Merchant's  (1945)  force  circle  diagram  for  orthogonal  cutting. 
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Figure  10,  Comparison  of  the  temperature  generated  in  the  cutting  region  near  the  tool  lip. 
between  the  thermal  model  after  Komanduri  and  Hou  (2001b)  and  the  MD  simulation 
results  (rake  angle,  10°;  cutting  velocity,  500ms”1). 


force,  thrust  force,  shear  force  and  frictional  forces,  as  shown  in  figure  9.  in 
this  section,  we  examine  the  relationship  between  these  forces  and  those  normally 
computed  in  MD  and  MC  simulations. 

The  total  force  Ftotal  exerted  by  the  cutting  tool,  is  the  sum  of  the  static  forces 
that  are  present  owing  to  the  interactions  between  the  workpiece  atoms  and  the 
workpiece-tool  atoms  plus  the  inertial  terms  for  each  atom  due  to  acceleration 
effects*  The  static  forces  arc  given  by  the  sum  of  the  negative  gradients  of  the 
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Figure  i  I.  Comparison  of  the  temperature  generated  in  the  cutting  region  near  the  tool  tipT 
between  the  thermal  model  after  Komanduri  and  Hou  (2001  b)  and  the  MD  simulation 
results  (cutting  velocity,  500m s“’;  rake  angle,  30°), 


potential  in  the  direction  of  the  force  while  the  inertial  forces  are  given  by  the  sum  of 
the  time  derivatives  of  the  appropriate  momentum  components  of  each  of  the  atoms. 
Thus,  the  total  cutting  force  is  given  by 


jntolal 


(14) 


since  the  cutting  direction  is  taken  to  be  the  Y  axis  (see  figures  2  and  3).  The  first 
double  summation  in  equation  (14)  is  the  sum  of  all  static  forces  between  the  N 
workpiece  atoms.  The  second  double  summation  incorporates  the  static  forces 
between  the  N  workpiece  atoms  and  the  Ntool  atoms  in  the  cutting  tool.  The  final 
summation  includes  all  the  Y  components  of  the  inertial  forces  on  the  workpiece 
atoms.  Since  the  tool  atoms  are  fixed  in  position,  these  terms  are  absent  for  the  tool. 
Consequently,  the  static  forces  obtained  in  the  MD  simulations  are  given  by 


In  the  case  of  MC  calculations,  the  method  converges  the  atoms  to  the  meta¬ 
stable  equilibrium  positions  that  characterize  the  system  at  each  position  of  the  tool 
during  the  cutting  process.  In  these  equilibrium  positions,  the  sum  of  the  inertial 
terms  will  be  zero.  Therefore,  the  static  forces  obtained  in  the  MC  calculations  will 
be  given  by 
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Since  most  of  the  inertial  terms  will  have  the  same  sign  as  we  expect  to 

have 


Therefore,  the  static  forces  obtained  in  MC  studies  will  exceed  those  for  MD 
simulations  under  corresponding  conditions.  As  the  cutting  speed  decreases,  the 
sum  of  the  inertial  forces  will  also  decrease.  Consequently,  at  low'  cutting  speeds, 
we  expect  to  see  the  forces  computed  in  MD  simulations  approach  those  obtained  in 
the  corresponding  MC  calculations, 

43.  Comparison  between  molecular  dynamics  and  Monte  Carlo  simulations 
of  nanometric  cutting  at  500ms~  l 

43. 1.  Molecular  dynamics  simulation  of  nanometric  cutting 

MD  simulations  of  nanometric  cutting  of  single-crystal  aluminium  work  material 
were  conducted  in  the  (100)  plane  with  cutting  in  the  [001]  direction  at  a  cutting 
velocity  of  500  ms”®  with  a  10°  rake  tool.  Table  l  gives  the  workpiece  and  tool 
dimensions  as  well  as  the  Morse  potential  parameters  used  in  the  simulation. 

Figures  12  (a)— (d)  show  various  stages  of  MD  simulation  of  nanomctric 
cutting  at  500  ms"1,  In  the  initial  stages  of  cutting,  very  little  deformation  ahead 
of  the  tool  is  observed.  As  cutting  progresses,  plastic  deformation  in  the  shear  zone 
can  be  seen.  As  the  tool  proceeds  further,  dislocation  generation  and  propagation  in 
the  workpiece  as  well  as  subsurface  deformation  can  be  seen  (figure  12(d)), 

Figures  7(a)  and  (b)  show  the  variation  in  the  static  cutting  and  thrust  forces 
with  cutting  time  for  a  10°  rake  tool  at  500  ms"1  in  MD  simulations  of  nanometric 


Table  I.  Workpiece  and  tool  dimensions,  cutting  parameters  and  Morse  potential 
parameters  used  in  MD  and  MC  simulation  of  nanometric  cutting  of  single-crystal 
aluminium  at  a  cutting  velocity  of  500  ms”1. 


Work  material 
Crystal  structure 

Morse  potential  parameters 
for  aluminium 

Of 

r* 

D 


Single-crystal  aluminium 
Fee 


1, 16460  A-1 
3.253  A 
0.2703  eV 


Configuration 


Two-dimensional  or  orthogonal  cutting 


Work  material  dimension 


Tool  material 


1 1  a,  50rj  and  11  a  along  the  Y  and  Z 
directions  respectively,  where  a  is  the 
lattice  constant  of  the  workpiece 

Infinitely  hard  (tungsten) 


Tool  dimensions 


22 a,  I  On  and  10a  along  the  X Y  and  Z 
directions  respectively,  where  a  is  the 
lattice  constant  of  the  tool 


Depth  of  cut 


4.05-5.1  A 
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Figure  12,  MD  simulation  at  various  stages  of  nanometric  cutting  (rake  angle,  10°;  cutting 

velocity,  500ms-1). 


cutting.  The  temperature  in  the  cutting  zone  is  calculated  using  equation  (10)  as 
described  in  §3,2.  Figure  13  shows  the  variation  in  the  temperature  in  the  cutting 
region  with  cutting  time  (same  plot  as  in  figure  10).  From  this,  the  mean  temperature 
is  estimated  to  be  about  900  K  at  500ms-1  with  a  10°  rake  tool. 
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Figure  |4.  MC  simulation  at  various  stages  of  nanometric  cutting  of  aluminium  (rake  angle* 

10°;  cutting  velocity,  500 ms''1). 
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4,3,2,  Monte  Carlo  simulation  of  nanometrie  cutting  at  500 ms~* 

MC  simulations  of  nanometrie  cutting  were  conducted  on  a  single-crystal 
aluminium  work  material  at  the  same  cutting  speed  of  500 ms-1  with  a  10°  rake 
tool.  Figures  1 4 (a)— (d)  show  the  results  of  the  MC  simulation  at  various  stages  of 
nanometrie  cutting.  On  comparing  figures  12  and  14,  it  can  be  seen  that  the  nature  of 
deformation  ahead  of  the  tool  as  well  as  the  subsurface  deformation,  in  both  MC 
and  MD  simulations  are  very  simitar.  Figures  15(a)  and  (b)  show  the  variations  in 
the  static  cutting  and  thrust  forces  respectively  with  cutting  time  in  MC  simulation 
with  a  10°  rake  tool  at  a  cutting  speed  of  500ms"1.  After  each  tool  advancement, 
the  temperature  in  the  cutting  region  was  calculated  using  the  thermal  model  of 
Komanduri  and  Hou  (2001b),  Figure  16  shows  the  variation  in  temperature  in  the 
cutting  zone  with  cutting  time  in  MC  simulation.  A  mean  temperature  of  about 
1000  K  was  obtained  during  the  cutting  process.  This  is  in  good  accord  with  the 
MD  simulation  results  shown  in  figure  13. 

Table  2  gives  a  comparison  between  MD  and  MC  simulation  of  nanometrie 
cutting  of  aluminium  work  material  at  the  same  cutting  speed  of  500ms_t  with 
a  10°  rake  tool.  Comparison  of  figures  7(a)  and  (b)  with  figures  15(a)  and  (b) 
shows  that,  as  expected,  the  static  cutting  and  thrust  forces  are  higher  for  the  MC 
simulation  than  those  obtained  in  the  MD  calculations.  The  MC  static  thrust 
force  is  about  35%  higher  than  the  MD  result.  The  static  cutting  force  obtained 
from  the  MC  calculation  exceeds  the  MD  result  by  amounts  varying  up  to  120% 
at  various  times  during  the  simulation.  These  differences  are  attributed  here  to 
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Figure  15.  Variation  in  (a)  cutting  and  (b)  thrust  forces  with  cutting  time  in  MC  simulation 
of  aluminium  (rake  angle,  10°;  cutting  velocity.  50Oms“l), 

the  presence  of  a  net  inertial  force  term  in  the  MD  calculations  that  is  absent 
for  MC  simulations  (see  equations  (14)-(17))  and  to  the  larger  cut  depth 
employed  in  the  MC  calculations.  The  somewhat  large  magnitude  of  these  differ¬ 
ences  is  due  primarily  to  the  large  cutting  speed  of  500  ms^1,  which  produces  a 
very  large  net  inertial  term.  At  experimental  cutting  speeds  of  about  2ms~‘  with 
equal  depths  of  cut,  the  differences  are  expected  to  be  much  smaller.  The  larger 
static  forces  present  in  the  MC  calculations  will  also  lead  to  larger  specific  ener¬ 
gies  than  those  obtained  using  MD  methods. 

4.4.  Monte  Carlo  simulation  of  nanometric  cutting  at  a  cutting  speed  of  5  ms~l 
MC  simulations  of  nanomctric  cutting  were  conducted  on  a  single-crystal 
aluminium  work  material  oriented  in  the  (001)  plane  and  cut  in  the  [100]  direc¬ 
tion  using  three  positive  rake  angle  tools,  namely  10°,  30°  and  45&.  Table  3  gives 
the  computational  parameters  used  in  the  MC  simulation  of  nanomctric  cutting  at 
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Figure  16,  Variation  in  temperature  in  the  cutting  zone  with  cutting  time  in  MC  simulation 
of  nanometric  cutting  (rake  angle,  IGQ;  cutting  velocity,  500  m  s-1). 


Table  2.  Comparison  between  MD  and  MC  simulation  for  nanometric  cutting  of  aluminium 
at  500  ms^1  and  10°  rake  angle  (workpiece  dimensions  are  given  in  table  1). 


Cutting  conditions  and  output  parameters 

MD 

MC 

Cutting  force  per  unit  width  (nN  A-1) 

1 ,255 

2.53 

Thrust  force  per  unit  width  (nN  A“l) 

0,958 

1.17 

Width  of  cut  (A) 

40.5 

43.1 

Depth  of  cut  (A) 

4.1 

5.1 

Force  ratio 

0,764 

0.462 

Specific  energy  ( 105  N  mm*2) 

0.306 

0,496 

Computational  time  (h) 

8— i  0 

14-16 

Table  3.  Computational  parameters  used 

in  the  MC  simulation  of  nanometric  cutting  of 

single-crystal  aluminium  at  a  cutting  velocity  Sms  . 

Configuration 

Two-dimensional  or  orthogonal  cutting 

Work  material  dimensions 

5*i  x  25 a  x  1 5/j,  where 

a  is  the  lattice 

constant  of  the  workpiece 

Tool  dimensions 

20a  x  l  lo  x  1  la,  a  is  the  lattice 

constant  of  the  tool 

Depth  of  cut 

5.1  A 

5ms"1,  Figures  1 7 (a)— (d),  1 8 (a)— (d)  and  1 9 (a)— (d)  show  various  stages  of  the  MC 
simulations  with  10°,  3Ga  and  45°  rake  tools  respectively.  As  the  tool  proceeds  into 
the  workpiece,  we  observe  the  formation  of  a  primary  shear  zone  anti  some  subsur¬ 
face  deformation  in  the  machined  surface.  We  also  observe  elastic  recovery  of  the 
atoms  in  the  machined  surface  as  the  tool  moves  away  from  the  machined  surface 
just  generated  (figure  17(d)).  The  high  elastic  recovery  is  attributed  to  the  basic 
principle  underlying  the  MC  method,  namely,  that  the  system  attempts  to  reach 
minimum  potential.  The  same  reason  is  also  applicable  for  the  increased  crystalline 
structure  observed  in  the  chip  during  nanomctric  cutting  instead  of  the  highly 
deformed  structure  in  the  chip  in  conventional  cutting  and/or  in  MD  simulations. 
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Figure  17,  MC  simulation  at  various  stages  of  nanometric  cutting  (rake  angle.  10L); 

cutting  velocity,  5m$“r). 


Figure  18.  MC  simulation  at  various  stages  of  nanometric  cutting  (rake  angle,  30"; 

cuuing  velocity,  5ms”1). 
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Figure  19,  MC  Simula ti on  at  various  stages  of  nanometric  cutting  (rake  angle,  45*;  cutting 

velocity,  5ms"1)* 


0  ' 

Figures  20(a)  and  (b)  show  the  variations  in  the  cutting  and  the  thrust  forces 
respectively  with  cutting  time  for  three  positive  rake  angles,  namely  10°,  30°  and  45°, 
in  MC  simulations  at  a  cutting  speed  of  5  m  s-1.  It  can  be  seen  that  both  cutting  and 
thrust  forces  decrease  as  the  rake  angle  increases.  Figure  21  shows  the  variations  in 
mean  cutting  and  thrust  forces  per  unit  width  with  rake  angle  in  MC  simulations  at  a 
cutting  speed  of  5  ms'1.  It  can  be  seen  that  both  the  forces  decrease  with  increase  in 
rake  angle.  Figure  22  shows  the  variation  in  the  force  ratio  (the  ratio  of  the  thrust 
force  to  the  cutting  force)  with  rake  angle  in  MC  simulations  at  a  cutting  speed  of 
5  ms'1,  The  force  ratio  decreases  as  the  rake  angle  increases  from  10°  to  45°  as 
shown  in  figure  22  and  table  4. 

Figure  23  shows  the  variations  in  frictional  force  per  unit  width  on  the  tool  face 
and  the  shear  force  per  unit  width  on  the  shear  plane,  respectively,  with  rake  angle  in 
MC  simulations  at  a  cutting  speed  of  5ms'1,  It  can  be  seen  that  both  forces  decrease 
as  the  rake  angle  increases  (see  table  5  for  details).  Figure  24  shows  the  variation  in 
shear  angle  with  rake  angle  at  a  cutting  speed  of  5ms'1,  It  can  be  seen  that  shear 
angle  increases  with  increase  in  rake  angle,  This  is  because  the  chip  thickness 
decreases  with  increase  in  rake  angle,  as  can  be  seen  from  the  MC  simulations  of 
nanometric  cutting  with  different  rake  angle  tools  (figures  17-19).  Figure  25  shows 
the  variation  in  specific  energy  (energy  required  for  removing  unit  volume  of  work 
material)  with  rake  angle  in  MC  simulations  at  a  cutting  speed  of  5  ms-1.  It  can  be 
seen  that  the  specific  energy  also  decreases  as  the  rake  angle  increases  from  10°  to  45° 
(see  table  4  for  details). 
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Time  (t,u.)  x  10s 

(a) 


(b) 

Figure  20,  Variation  in  (a)  cutting  and  (b)  thrust  forces  with  cutting  time  in  M C  simulation 
of  nanomctric  cutting  for  different  rake  angles  at  a  cutting  speed  of  Sms'®. 


Figure  26  shows  the  variation  in  temperature  in  the  cutting  zone  with  culling 
time.  It  can  be  seen  that  temperature  in  the  cutting  zone  at  5  ms'1  is  about  400  K. 
This  is  in  contrast  with  the  temperature  in  the  cutting  zone  of  about  1000  K  at 
500  m  s_1* 


§5.  Conclusions 

(J)  MC  simulation  of  nanometric  cutting  was  conducted  for  the  first  time  at 
a  practical  cutting  speed  of  5ms-1.  The  MC  method  incorporates  a 
Markov  random  walk  and  the  Metropolis  acceptance -rejection  criterion 
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Figure  21.  Variation  in  cutting  force  per  unit  width  and  thrust  force  per  unit  width  with 
rake  angle  in  MC  simulation  of  nanometric  cutting  at  a  cutting  velocity  of  5  ms"1. 


adopted  for  use  with  temperature  gradients  to  determine  the  new  configura¬ 
tions  of  the  atoms. 

(2)  The  temperature  in  the  cutting  zone  in  MD  and  MC  simulations  of  nano¬ 
metric  cutting  was  estimated  using  the  thermal  model  of  the  cutting  process 
developed  by  Komanduri  and  Hou  (2001b)  on  the  macroscopic  scale. 
Reasonably  good  agreement  was  found  between  the  results  of  the  thermal 
model  and  MD  simulation  results  obtained  using  a  harmonic  oscillator 
energy  partitioning.  This  local  temperature  is  employed  in  the  Boltzmann 
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Table  4.  Results  of  MC  simulation  of  nanometric  cutting  of  aluminium  at  5  m  s  1  for  various 
rake  angles,  namely  10°,  30°  and  45°  (see  table  3  for  workpiece  dimensions). 


Rake  angle 
(deg) 

Cutting  force  per 
unit  width 
(N  mm-1) 

Thrust  force  per 
unit  width 
(N  mm"1) 

Force 

ratio 

Specific  energy 
(105Nmm J) 

10 

3.0253 

1.854 

0.613 

0.593 

30 

2.0885 

1.128 

0.540 

0.384 

45 

1.5556 

0.796 

0.512 

0.305 

Table  5.  Results  of  MC  simulation  of  nanometric  cutting  of  aluminium  al 
5  ms"1  for  various  rake  angles  (sec  table  3  for  workpiece  dimensions). 


Rake  angle 
(deg) 

Shear  force  per 
unit  width 
(N  mm"1) 

Shear  angle 
(deg) 

Frictional  force 
per  unit  width 
(N  mm-1) 

10 

2.46 

14.54 

3.30 

30 

1.48 

23.84 

2.25 

45 

0.84 

34.24 

1.67 

acccptance-rcjcction  criterion.  Thus,  cutting  speed  is  indirectly  incorporated 
into  the  MC  simulation  via  the  temperature  in  the  cutting  zone  since  cutting 
speed  and  cutting  velocity  are  related. 

(3)  A  comparison  of  MD  and  MC  simulations  of  nanometric  cutting  conducted 
at  a  cutting  velocity  of  500 ms-1  yielded  reasonably  good  agreement  under 
similar  cutting  conditions.  As  expected,  static  forces  obtained  from  MC 
calculations  exceed  the  corresponding  MD  results  due  to  the  presence  of 
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Figure  24.  Variation  in  the  shear  angle  with  rake  angle  in  MC  simulation  of  nanomclric  of 
aluminium  at  a  culling  speed  of  5  ms-1. 


inertial  forces  in  the  MD  simulations  and  to  a  longer  cut  depth  for  the  MC 
studies. 

(4)  MC  simulations  of  nanometric  cutting  were  conducted  on  a  single-crystal 
aluminium  work  material  in  the  (001)  plane  and  cut  in  the  [100]  direction 
using  three  positive  rake  angle  tools,  namely  10°,  30°  and  45°,  at  a  cutting 
velocity  of  5ms"1  to  investigate  the  effect  of  rake  angle  on  the  cutting  and 
thrust  forces,  force  ratio,  frictional  force  on  the  tool  face,  shear  force  on  the 
shear  plane,  shear  angle  and  specific  energy.  The  cutting  and  thrust  forces, 
the  force  ratio,  the  frictional  force  per  unit  width  on  the  tool  face,  the  shear 
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Figure  26.  Variation  in  temperature  in  the  cutting  zone  with  cutting  time  in  MC  simulation 
of  nanometric  cutting  (rake  angle,  10°;  cutting  velocity,  5ms”1). 


force  per  unit  width  on  the  shear  plane,  and  the  specific  energy  all  decreased 
as  the  rake  angle  increases,  as  found  in  practice.  Only  the  shear  angle,  as 
expected,  was  found  to  increase  with  increase  in  rake  angle  owing  to  the 
decrease  in  the  chip  thickness  with  increase  in  rake  angle.  These  variations 
with  rake  angle  are  consistent  with  practical  cutting  experience. 

(5)  The  temperature  in  the  cutting  zone  at  a  cumng  velocity  of  5ms"1  was 
estimated  to  be  about  400  K  with  a  10°  rake  tool  This  is  in  contrast  with 
the  temperature  in  the  cutting  zone  of  about  1000  K  with  a  \0°  rake  tool  at 
500m  s”1. 

(6)  Although  this  is  not  shown  expressly  in  the  present  paper  there  appears  to  be 
ample  scope  for  optimization  of  the  MC  simulation  of  nanometric  cutting  by 
combining  MC  simulation  with  the  DT  or  the  steepest-dcscent  technique  as 
well  as  with  the  use  of  massively  parallel  processors  to  reduce  significantly 
the  computational  time.  Work  in  this  area  is  currently  under  way  with  initial 
indications  of  some  promising  results. 
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Abstract;  In  order  to  reduce  the  computational  time,  Monte  Carlo  (MC)  simulations  of 
nano  metric  cutting  have  been  modified  to  include  a  combination  of  steepest  descent  (SD) 
and  Monte  Carlo  procedures.  This  MC-SD  combination  is  found  to  reduce  the  required 
computational  times  by  factors  of  at  least  two  to  three  over  those  achieved  using  MC  methods 
alone.  The  MC-SD  method  is  applied  to  the  nanometric  cutting  or  single-crystal  aluminium 
along  the  (100)  plane  with  different  rake  angle  tools  at  a  cutting  speed  of  5 m/s.  The  results 
obtained  from  the  MC-SD  calculations  are  found  to  be  almost  identical  to  those  resulting 
from  the  MC  simulations. 

Keywords;  nanoelectronics*  nanowires,  electronic  assemblies/devices 


1  INTRODUCTION 

Molecular  dynamics  (MD)  simulations  are  powerful 
tools  lor  modelling  Lhe  machining  process  at  nano- 
scale  [II.  However,  if  such  simulations  are  carried  out 
using  experimental  speeds  for  cutting,  grinding,  or 
tensile  pulling,  Lhe  required  computational  times 
become  prohibitively  large.  For  ibis  reason,  most  MD 
simulations  are  conducted  ai  high  cutting  speeds 
(100-500  mis).  The  major  part  of  this  computational 
time  is  devoted  to  numerical  integration  of  the  classi¬ 
cal  Newtonian  equations  of  motion  for  the  interacting 
particles  to  obtain  the  temporal  behaviour  of  the  work- 
piece  atoms.  If,  at  any  given  moment,  the  cutting 
process  is  stopped,  continued  integration  of  Lhe  equa¬ 
tions  of  motion  will  give  the  linal*  melas table  structure 
of  the  workpiece  atoms.  Moreover,  it  will  be  possible 
not  only  to  have  this  final  structure,  hut  also  detailed 
information  concerning  the  mechanism  by  which  the 
individual  atoms  attain  that  structure.  In  many  cases, 
it  is  only  the  final  metasiable  configurations  that  are 
of  interest.  When  this  is  the  situation,  MD  calculations 
provide  far  more  information  than  that  is  required, 
and  that  information  is  obtained  at  the  expense  of 
extremely  large  quantities  of  computational  lime. 
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Oklahoma  State  University.  2  IS  Engineering  North .  St  ill  Ufa  ter 
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In  many  applications,  the  above  problems  have 
been  avoided  by  employing  Monte  Carlo  (MC) 
methods.  Such  procedures  allow  the  metasiable  or 
equilibrium  states  for  the  system  of  interest  to  be 
determined  without  the  necessity  of  solving  large 
sets  of  coupled  differential  equations  dial  provide 
unnecessary  information  about  the  dynamics 
through  which  these  stales  are  attained.  In  the 
large  majority  of  cases.  MC  methods  are  applied 
to  canonical  systems  that  are  characterized  by 
phase-space  distributions  that  are  Boltzmann  so 
that  the  system  lias  a  well-defined  temperature 
1 2,  3|.  MC  procedures  have  also  been  successfully 
applied  to  microcanonical  systems  in  which  the 
total  system  energy'  is  fixed  and  all  system  config¬ 
urations  that  have  a  given  total  energy  have  equal 
probability  of  existence.  Nyman,  et  ai  [41  and 
Severity  et  ai  (5|  have  developed  procedures  for 
executing  efficient  sampling  of  such  microcanonical 
systems.  These  methods  have  been  employed  by 
Schranz,  et  ai  [6|  in  studying  simple  bond  scission 
reactions  of  1 ,2-difluoroetbane  [7]  and  both  silylene 
and  disilane  |8|. 

The  principal  problem  dial  must  be  addressed  to 
develop  an  MC  applicable  to  cutting  experiments  is 
to  find  a  procedure  that  will  incorporate  the  effects 
of  cutting  speed.  Since  there  is  no  time  variable  in 
an  MC  calculation,  it  is  not  immediately  obvious 
how  this  might  he  done  for  systems  that  are  neither 
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canonical  nor  microcanon ical.  This  can  be  accom¬ 
plished  |9,  10]  by  defining  a  local  temperature  in 
die  cutting  zone  using  Lhe  thermal  analysis  devel¬ 
oped  by  Komanduri  and  Hou  J 11-13)  lor  conven¬ 
tional  machining.  Extension  of  this  method  to  the 
nanometric  regime  permits  an  accurate  estimate  of 
the  local  temperature  in  cutting  as  a  function  of 
the  spatial  coordinates  of  the  workpiece.  This 
temperature  is  l hen  employed  in  lhe  Boltzmann 
probability  distribution  function  that  is  used  to 
determine  the  acceptance-rejection  ratio  of  the  MC 
moves  in  the  simulation.  Since  cutting  speed  is 
closely  related  to  cutting  temperature*  the  cutting 
speed  enters  the  calculation  via  the  thermal  analysis 
equations. 

A l  cutting  speeds  around  500  m/s,  it  was  found 
that  MC  simulations  for  cutting  of  an  aluminium 
workpiece  with  a  10  rake  angle  require  somewhat 
more  time  than  the  corresponding  MD  calculations 
[9,  10!.  This  is  due  to  the  very  large  number  of 
MC  moves  that  must  he  executed  to  converge  to 
the  desired  metastable  structures  throughout  the 
simulation.  However,  as  the  cutting  speed 
decreases,  MD  calculations  become  increasingly 
demanding  of  computational  time  whereas  MC 
methods  are  nearly  independent  of  cutting  speed. 
Therefore,  at  low  cutting  speeds,  MC  calculations 
require  two  orders  of  magnitude  less  time  than 
MD  simulations. 

In  ibis  paper,  an  improved  MC  method  is  presented 
that  decreases  the  required  computational  time  by  at 
least  a  factor  of  two  to  three  depending  upon  the 
experimental  conditions,  die  number  of  atoms  in 
the  model,  and  -  presumably  -  upon  the  nature  of 
the  interaction  potential  employed.  The  modification 
is  based  on  the  Tact  that  MC  methods  always  drive  the 
configuration  of  the  system  toward  a  potential  mini- 
mum.  Therefore,  the  number  of  MC  moves  required 
to  attain  this  minimum  can  be  reduced  by  using  a 
damped  trajectory  (DT)  procedure  |14|  or  steepest 
descent  (SD)  method  to  com  pule  a  configuration 
dose  to  the  potential  minimum  before  executing 
MC  moves.  Because  DT  and  SD  methods  converge 
to  the  nearest  local  potential  minimum,  the  above 
methods  alone  will  usually  not  correctly  determine 
the  final  metastable  configuration  attained  by  the 
system  at  a  given  point  during  a  simulation*  To 
circumvent  this  problem,  a  series  of  MC  moves 
obtained  from  a  Markov  chain  [2,  3]  are  executed 
after  completion  of  the  DT  or  SD  calculation.  By 
alternating  MC  moves  and  either  SD  or  DT  calcula¬ 
tions,  the  system  rapidly  converges  to  the  desired 
metastabie  configuration.  Tor  the  systems  investi¬ 
gated  in  ill  is  research,  it  was  found  that  the  MC-SD 
computational  limes  are  reduced  by  factors  ranging 
between  iwo  in  three  over  those  required  for  purely 
MC  simulations. 


2  BRIEF  REVIEW  OF  LITERATURE 

MD  simulations  were  initiated  in  the  late  1950s  at  the 
Lawrence  Radiation  Laboratory  (LRLO  in  the  United 
States  by  Alder  and  Wainwright  [15]  in  the  field  of 
statistical  mechanics.  Since  then,  MD  simulations 
have  been  extensively  applied  to  many  diverse 
fields,  including,  crystal  growth  [161,  indentation 
(171,  tribology  [18!,  low-pressure  diamond  synthesis 
j  1 9-24 \ ,  and  laser  interactions  |25],  to  name  a  Few. 
MD  simulation  of  nanometric  cutting  was  pioneered 
by  Belak.  Boercker,  and  Stowers  |26,  27)  at  the 
Lawrence  Livermore  National  Laboratory  (LLNL). 
This  work  was  followed  by  some  research  groups  in 
Japan  [26-35]  and  in  the  US  by  the  authors  of  Lhe 
present  paper  at  Oklahoma  State  University  [36]. 
For  details,  the  readers  are  referred  to  an  extensive 
review  article  on  this  subject  by  the  authors  contain¬ 
ing  some  124  references  [1!. 

MC  methods  utilize  random  procedures  that  allow 
simple  solutions  to  be  obtained  for  physical  problems 
of  a  complex  nature  ]37,  3B|.  The  term  ‘Monte  Carlo’ 
is  used  in  connection  with  the  probability  sampling 
techniques  that  approximate  the  solutions  of  a  wide 
range  of  mathematical  and  physical  problems. 
Metropolis  et  at.  [39]  of  the  Los  Alamos  National 
Laboratory  in  New  Mexico  were  the  first  to  introduce 
the  MC  method  for  the  study  of  a  system  of  inter¬ 
acting  particles  in  1953,  Since  then  it  has  been  used 
extensively  in  physics,  chemistry,  economics,  weather 
forecasting*  and  financial  transactions,  to  name  a  few. 
The  literature  on  these  topics  is  -  quite  extensive, 
including  several  textbooks  [37,  38,  40,  41  J.  However, 
virtually  all  of  these  are  on  systems  that  are  either 
canonical  or  microcanonical.  in  these  cases,  there 
are  usually  no  external  forces  present  and  the 
system  energy  is  conserved. 

There  are,  however,  very  few,  if  any,  MC  papers 
dealing  with  such  systems  as  culling  where  there 
are  large  external  Forces  that  cause  the  system 
energy  to  grow  rapidly  in  such  a  manner  that  the 
system  is  neither  canonical  nor  microcanonical.  The 
present  authors  were,  perhaps,  the  first  to  apply  MC 
simulations  lo  nanometric  cutting  19 j .  This  approach 
enabled  simulation  of  the  cutting  process  at  conven¬ 
tional  speeds  (5  m/sj  instead  of  extremely  high  speeds 
(100-500  m/s)  used  in  conventional  MD  simulations. 
As  already  pointed  out,  this  is  accomplished  by  defin¬ 
ing  a  local  temperature  in  the  cutting  zone  using  the 
thermal  analysis  developed  by  Komanduri  and  llou 
1 11-131.  This  temperature  was  then  employed  in  the 
Boltzmann  probability  distribution  that  is  used  to 
determine  the  acceptance-rejection  ratio  of  the 
Monte  Carlo  moves  in  the  simulations.  Since  cutting 
speed  is  closely  related  to  cutting  temperature,  the 
cutLing  speed  thus  enters  lhe  calculation  via  lhe 
thermal  analysis  equations.  The  present  investigation 
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is  a  follow  up  of  that  work  with  ihe  objective  of 
reducing  the  computational  time  by  using  a  combina¬ 
tion  of  SD  and  MC  procedures. 


3  LOCAL  MINIMUM  METHODS 

3. 1  Damped  trajectories  (DTs) 

In  a  DT  procedure,  the  kinetic  energy  of  all  the  Janice 
atoms  is  set  to  zero  and  the  i  Jamiltonian  equations  of 
motion  for  these  atoms  are  integrated  to  a  point  at 
which  the  total  potential  energy  attains  a  minimum. 
This  is  defined  as  one  DT  cycle.  Once  the  system 
passes  through  a  potential  minimum,  the  total  kinetic 
energy  is  reset  to  zero  and  subsequent  cycles  are 
executed  until  convergence  to  a  local  minimum 
potential  is  reached.  Since  the  system  potential  is 
independent  of  the  nuclear  masses,  the  time  required 
for  the  execution  of  the  DT  procedure  can  be 
decreased  by  setting  the  mass  of  all  the  workpiece 
atoms  equal  to  that  or  hydrogen.  Moreover,  since 
energy  conservation  requirements  are  of  no  concern, 
larger  integration  step  sizes  may  be  employed. 
Komanduri,  el  ai  |I4]  previously  employed  a 
combination  of  M C  and  DT  procedures  in  their 
studies  of  uniaxial  tension  of  face-centred  cubic 
(FCQ  metals.  However,  when  the  meiastable  struc¬ 
tures  become  very  amorphous  as  is  the  case  in  cutting 
simulations,  the  convergence  rate  For  damped 
trajectories  becomes  very  slow.  In  such  cases.  SD 
methods  were  found  to  offer  superior  computational 
speeds. 


3.2  Steepest  descent 

A  SD  method  makes  use  of  the  fact  that  the  gradient 
of  a  hypersurface  is  a  vector,  the  direction  of  which 
corresponds  to  the  direction  of  the  maximum  rate 
of  increase  of  the  value  of  the  hypersurface  142-441. 
Thus,  a  move  is  made  in  the  direction  opposite  to 
the  surface  gradient,  that  will  be  a  movement  towards 
the  surface  minimum.  The  method  can  be  repre¬ 
sented  by 

K,+ ,  =  K-trfV  (l) 

where,  a  is  the  step  size  in  the  direction  of  the 
gradient  and  VF  is  the  surface  gradient  at  point  /, 
As  n  increases,  the  convergence  rate  to  the  potential 
minimum  increases,  but  the  resolution  of  the  precise 
position  of  the  minimum  decreases.  One  SD  cycle  is 
defined  as  being  one  movement  or  every  atom  of 
the  workpiece  material  according  to  equation  (1). 
Figure  l  shows  a  flow  diagram  for  the  execution  of 
one  SD  cycle.  Convergence  is  monitored  by  examina¬ 
tion  or  the  magnitude  of  change  of  the  total  potential 
in  successive  SD  cycles. 


if  i  <  total  atoms 


Fig.  I  Flow  diagram  For  the  execution  of  one  SD  cycle 


In  the  present  application,  the  workpiece  and 
workpiece-tool  interaction  potentials  are  assumed 
to  be  a  summation  of  pairwise  Morse  functions,  i.e. 


Xw* 


^Tuini  =  51 5Z  fw(r#) +  ^2  5Z  ^(ry) 


(2) 


i  \  )  >  i 


i  I  /  1 


In  equation  (2),  Vy}p (n,)  and  are  the  pairwise 

interaction  potentials  between  workpiece  atoms  and 
between  workpiece-tool  atoms,  respectively.  Nwi* 
and  /VT  are  the  number  of  atoms  in  the  workpiece 
and  tool,  respectively.  The  pairwise  terms  are  given  by 


f"wi >W)  =  £wp{  exp[-2«|  {r,t  -  »f)| 

-  2  exp[-«,  (r„  -  rj®)] }  for  r„  sS  rWp 


f\vt>('7,)  =  0  fur  r„  >  rWp 
K'(ru)  =  0i.  { exp(-2r*2(r„  -  r?')] 

-2exp|-rt;{r(/  -  rf)]  I  for  j£  rjj 


and 

^lV#)  =  o  for  r„  >  li  (1) 

In  equation  (3),  r'w,.  and  r\  are  the  cut-off  radii  for 
workpiece-workpiece  and  workpiece- tool  inter¬ 
actions,  respectively.  The  gradients  required  for  the 
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SD  cycles  are  easily  computed  using  equations  (2) 
and  (3)  combined  with  the  chain  rule  lo  convert  ihe 
derivatives  to  Cartesian  coordinates. 


4  MC-SD  SIMULATION  MODEL 

The  MC-SD  method  combines  MC  and  SD  caleula- 
lions  in  a  manner  designed  to  achieve  rapid  conver¬ 
gence  to  the  equilibrium  metastable  structures  that 
characterize  various  stages  of  ihe  cutting  process. 
By  alternating  sets  of  MC  moves  with  SD  cycles,  ihe 
desired  metastable  structures  can  be  rapidly  deter- 
mined,  without  the  necessity  to  integrate  thousands 
of  differential  equations.  Consequently,  the  need  to 
compute  millions  of  potential-surface  derivatives 
whose  functional  forms  are  generally  much  more 
complex  than  the  potential  itself  can  be  avoided.  In 
addition,  the  MC  calculations  require  the  change  in 
potential  rather  than  Lhe  potential  itself.  Therefore, 
those  potential  terms  that  change  upon  execution  of 
Markov  step  need  lo  be  considered.  This  will  be  a 
small  subset  of  the  total  set  of  terms  comprising 
the  total  potential.  As  a  result,  the  computational 
time  required  for  the  combined  MC-SD  procedure 
is  greatly  reduced  over  that  needed  for  an  MD 
method. 

Figure  2  shows  a  schematic  model  used  for  MC  and 
MC-SD  cutting  simulations  of  a  single-crystal  alumi¬ 
nium  workpiece.  The  workpiece  is  divided  in  two 
zones,  a  moving  zone  and  a  boundary  or  stationary 
zone.  Since  nature  always  drives  any  system  towards 
its  most  stable  configuration  or  minimum  potential, 
in  MC  simulations  configurations  are  randomly 
selected  that  move  the  system  towards  this  configura¬ 
tion.  In  most  cases,  the  probability  density  function 
for  configuration  points  is  unknown  so  that  this 
selection  cannot  be  accomplished  in  a  straight¬ 
forward  fashion.  A  common  method  to  handle  Lhis 
type  of  problem  involves  the  generation  of  random 


configurations  using  Markov  chains  generated  by 
the  Metropolis  sampling  procedure  |2,  3]. 

Consider  an  initial  configuration  </{>  for  an  ,V-particle 
system.  The  Boltzmann  distribution  function  P(qa)  for 
canonical  systems  is  given  by 


f’tad  =  Cexp 


iM 

kT 


<4) 


where  V  and  T  are  the  potential  and  temperature, 
respectively,  for  configuration  r/u.  A  is  the  Boltzmann 
constant  and  C  is,  in  effect,  a  normalization  constant. 
A  subset  of  m  particles  from  a  total  of  N  particles  is 
now  selected  and  the  coordinates  of  these  particles 
are  randomly  varied.  Here,  m  is  very  small  compared 
with  N.  For  the  ilh  atom,  the  new  configuration  is 
obtained  using  the  following  equal  ions 


,\f"'  =  xf  +  (0.5  -  £d)A.v 

(?) 

yr  =  yf  +  (0.5  -  fo)Ar 

(6) 

=T  =  =f'  +  (0.5-e3)Ar 

0) 

where  .vf'",  #ifrtr,  and  z?'"  are  the  new  configuration 
coordinates  for  panicle  /  and  .v^,  yff,  and  -;j/^  are 
those  for  the  old  configuration.  A.v,  Ay,  and  Ar  are 
the  maximum  step  sizes  along  ihe  .V,  Y,  and  Z 
directions,  respectively,  and  the  £,  are  randomly 
chosen  numbers  in  the  interval  |0.  I|. 

For  canning)!  systems,  this  new  configuration  point 
will  be  either  accepted  or  rejected  depending  upon 
the  change  in  the  Boltzmann  probability  given  by 
equation  (4).  The  move  is  accepted,  if 

exp  (“Jf )  ?  £  (8) 


where  £  is  a  random  number  in  the  interval  [0, 1 1  and 
AV  -  Vllfxr  -  Vm*  If  the  move  is  rejected,  the  con¬ 
figuration  q0  is  returned  to  and  the  procedure  is 
repeated  until  a  move  is  accepted.  More  details  of 


Boundary'  atoms 


O 

O 


Moving  atoms 


Fig.  2  Schematic  model  used  for  MC  and  MC-SD  cutting  simulations  of  a  single-crystal  aluminium 
workpiece 
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i tie  procedure  are  provided  in  Komanduri*  et  at.  |9|, 
The  principal  problem  encountered  when  applying 
MC  methods  10  cutting  simulations  is  that  the  system 
is  neither  canonical  nor  microca  ironical,  Large 
temperature  gradients  are  present  and  hence,  the 
temperature  appearing  in  equation  (8)  is  undefined. 
In  the  initial  work  of  Komanduri,  et  at.  |9L  this  prob¬ 
lem  was  addressed  by  defining  a  local  temperature 
and  then  employing  the  thermal  analysis  equations 
developed  by  Komanduri  and  Uou  f  1 1-13]  to  obtain 
this  local  temperature  as  a  function  of  the  spatial 
coordinates  of  the  atoms  on  which  Markov  moves 
are  being  executed*  This  thermal  analysis  considers 
the  combined  effect  of  two  principal  heat  sources, 
the  shear  plane  and  the  tool-chip  interface  frictional 
heal  sources.  Since  one  of  the  variables  that  enters 
the  analysis  is  the  cutting  speed,  this  procedure  per¬ 
mits  the  incorporation  oT  the  effects  of  experimental 
cutting  speed  into  the  simulations. 

The  MC-SD  calculations  are  conducted  by 
repeated  application  of  the  following  steps: 

(a)  translation  of  the  tool  in  the  cutting  direction; 

(b)  calculation  of  the  temperature  rise  using  the 
thermal  model  of  Komanduri  and  Hou  113]  in 
the  chip*  the  shear  zone,  and  beneath  the  tool: 

tc]  calculation  of  M  SD  cycles  to  move  the  atoms  of 
the  system  dose  to  the 

configuration  corresponding  to  the  nearest  local 
minimum; 

(d)  execution  of  a  Markov  walk  of  Sv  and  S\.  steps  for 
atoms  in  the  workpiece  and  in  the  chip,  respec¬ 
tively,  using  the  temperature  distribution  calcu¬ 
lated  in  step  (b). 

Steps  (a)-(d)  are  repeated  until  the  desired  cutting  is 
achieved* 


5  APPLICATION  AND  RESULTS 

The  MC-SD  procedure  was  tested  against  pure  MC 
simulations  using  pairwise  potentials  to  model  the 
cutting  process  along  the  (1001  plane  of  a  single- 
crystal  aluminium  workpiece  with  an  infinitely  hard 
tool*  The  potential  parameters  are  given  in  Table  l. 


Table  1  Morse  parameters  used  in  the  MC-SD 
and  MC  simulations  of  nano  me  trie 
cutting  of  single -crystal  aluminium 


Parameter 

Value 

ttwi- 

0,27030  eV 

,irt 

1*154(50  A  1 

r? 

3.253  A 

0,087  <?V 

f»* 

5*14  A  1 

r? 

2*05  A 

Table  2  MC  and  MC-SD  parameters 
employed  in  the  calculations 


Pa  rammer 

Value 

A.v 

0.1  A 

Ar 

0.1  A 

0.1  A 

n 

0.01  k 

M 

4 

(Am  )mi 

40 

400 

Mmc 

1 

20 

(■SeJMrsu 

10O 

1 

The  MC  and  SD  parameters  employed  in  the  study 
are  given  in  Table  2* 

The  MC-SD  method  was  first  tested  by  simply 
relaxing  a  (_S  x  40  x  i  5)  aluminium  workpiece  using 
MC  methods  alone  and  comparing  the  system  poten¬ 
tial  at  each  iteration  with  that  obtained  when  the 
relaxation  was  executed  using  the  MC-SD  procedure. 
The  comparison  is  shown  in  Pig.  3.  it  can  be  seen  that 
in  both  cases,  the  potential  initially  drops  rapidly  and 
then  fluctuates  about  a  common  mean  value  in  both 
methods*  At  any  given  point  in'  the  process,  the  per¬ 
cent  difference  between  the  potentials  computed  by 
the  two  methods  is  always  less  than  0.12  per  cent. 
Thus,  it  can  be  seen  that  the  same  relaxed  state  is 
achieved  using  both  MC  and  MC-SD  methods.  How¬ 
ever,  significantly  fewer  MC  moves  are  required  for 
the  MC-SD  calculations*  The  results  also  demonstrate 
that  SD  methods  attain  convergence  more  rapidly 
than  purely  MC  procedures. 

In  the  second  test,  cutting  simulations  were 
conducted  with  a  10  rake  angle  tool  at  a  cut  depth 
corresponding  to  three  layers  of  aluminium  atoms 
at  a  cutting  speed  of  5  m/s.  The  change  in  potential 
of  the  metastable  configuration  obtained  at  the  com¬ 
pletion  of  each  movement  of  the  tool  was  computed 
for  each  of  the  simulations.  The  total  cutting  distance 
was  80  A,  This  distance  was  covered  by  800  tool  move¬ 
ments,  each  covering  a  cutting  distance  of  0. 10  A.  The 
results  for  both  MC  and  MC-SD  simulations  are 
shown  in  Fig.  4.  As  can  be  seen*  there  is  virtually  no 
difference  in  the  potentials  of  the  computed  final 
metastable  configurations*  The  maximum  absolute 
percent  difference  is  about  0.18  per  cent.  Since  diis 
difference  is  less  than  the  statistical  error  present  in 
the  MC  method,  it  is  not  expected  to  exert  any 
effect  on  the  final  results.  Clearly,  the  two  methods 
give  equivalent  results* 

Figure  5  shows  the  computed  change  in  cutting 
force  during  the  cutting  simulations  used  to  obtain 
the  results  shown  in  Fig.  4*  The  lime  on  the  abscissa 
of  Fig,  5  is  computed  by  simply  dividing  the  cutting 
speed  by  the  total  distance  moved  by  the  tool.  It  is 
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Fig.  3  Potentials  during  relaxation  of  a  (5  x  411  x  1 5}  aluminium  workpiece  whose  potential  is  described 
by  equations  (2)  and  (3)  with  the  parameters  given  in  Table  L  The  two  curves  show  close 
agreement  of  the  results  obtained  using  MC  and  MOSD  methods 
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Fig.  4  Changes  in  the  potential  of  metastable  configurations  obtained  by  MC  and  MOSD  simulations 
of  aluminium  cutting  at  5m/s,  10  rake  angle,  at  a  cm  depth  corresponding  to  three  layers  of 
lattice  atoms 


clear  that,  except  Tor  statistical  difference  produced 
by  the  stochastic  nature  of  MC  calculations,  the  two 
results  are  essentially  identical. 

Figure  6  shows  snapshots  of  the  configuration  of 
the  system  at  three  different  stages  of  cutting  for  the 
two  simulations.  Again,  it  can  be  seen  that  the  two 
methods  are  in  excellent  agreement. 

Finally,  four  different  comparison  calculations  or 
the  two  met  hods  were  conducted  using  the  same 
cutting  parameters  and  potential  parameters  as 
those  employed  to  obtain  the  results  shown  in  Tigs 
3-6.  The  calculations  differ  only  in  the  total  number 


of  lattice  atoms  included  in  the  aluminium  work- 
piece.  The  criterion  Tor  the  selection  of  the  number 
of  atoms  in  the  simulation  is  size  convergence,  i.e. 
when  Lhe  results  become  insensitive  to  further 
increase  in  the  number  of  atoms  considered,  the 
model  is  adequate.  It  was  typically  found  in  the 
nanomeiric  cutting  studies  that  the  size  convergence 
is  obtained  with  a  few  thousand  atoms  (<  10000)  |1|. 

Figure  7  shows  a  comparison  of  the  computational 
times  required  by  both  MC  and  MC-SD  simulations. 
It  may  be  noted  that  selected  8  700  lattice  atoms 
have  been  selected,  which  is  typical  of  MD 
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Fig.  5  Computed  changes  in  cutting  force  during  the  MC  and  MC-SD  simulations  used  to  obtain  the 
results  shown  in  Fig,  4.  Time  is  computed  from  the  ratio  of  the  cutting  speed  and  total  distance 
moved  by  ihe  tool 


MC  Simulations  MC-SD  Simulations 

Hg.  6  MC  and  MC-SD  simulations  of  cutting  of  a  (5  x  4(1  x  15)  aluminium  workpiece  with  in  rake 
angle  at  a  cut  depth  corresponding  to  three  layers  of  lattice  atoms  with  a  cutting  speed  of 
SiTi/s,  The  system  potent  ia  I  is  given  by  equations  (2)  and  (3]  with  the  parameter  values  listed 
in  Table  1,  The  snapshots  on  the  left  are  those  obtained  from  the  MC  calculations.  Those  on 
the  right  are  snapshots  at  the  same  moment  of  time  determined  by  MC-SD  methods 
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Fig,  7  Computational  limes  for  MC  and  MC-SD  simulations  of  cutting  of  an  aluminium  workpiece  at  a 
cutting  speed  of  5  m/s  at  a  cut  depth  corresponding  to  three  layers  of  lattice  atoms  as  a  function 
of  the  size  of  the  simulated  workpiece.  The  system  potential  is  given  by  equations  (2)  and  [3)  with 
the  parameter  values  listed  in  Table  I 


simulations  of  nanometnc  cutting,  and  17200  as  an 
upper  limit.  For  a  system  containing  8700  la  it  ice 
atoms,  the  computational  time  ratio  for  MC  and 
MC-5D  calculations  is  about  2.7.  When  the  number 
of  lattice  atoms  increases  to  17  200,  this  ratio 
decreases  to  about  L9,  Consequently,  a  computa¬ 
tional  lime  advantage  between  a  factor  of  two  to 
three  using  the  MC-SD  method  can  be  achieved.  It 
should  be  recognized  that  these  ratios  represent 
lower  limits  to  the  possible  computational  advantage 
lhat  might  be  obtained  for  the  test  system  considered 
here.  No  effort  was  made  to  optimize  the  MC-SD 
method  by  adjustment  of  a,  M,  {£w)mc_su»  and 
(SJmc-so*  ^  such  an  optimization  is  done,  further 
savings  in  processing  time  are  anticipated. 


6  CONCLUSIONS 

1.  To  reduce  the  computational  time.  MC  simulations 
of  nanometnc  cutting  have  been  modified  to 
include  a  combination  of  SO  and  MC  procedures. 

2,  Simulations  involving  MC-SD  combinations  are 
found  (o  reduce  the  required  computational 
times  by  a  factor  of  two  to  three  over  those 
achieved  using  MC  methods  alone,  thus  offering 
computational  time  advantages  with  this 
method.  As  the  number  of  atoms  in  the  model 
decreases,  the  reduction  in  computational  time 
increases.  The  results  obtained  from  the  MC-SD 
calculations  are  found  to  be  almost  identical  to 
those  resulting  from  the  MC  simulations. 


3.  It  is  anticipated  that  optimization  of  the  MC-SD 
method  by  adjustment  of  nt  A/t  so*  and 

(.S;,)mc-sl>  may  provide  additional  computational 
time  advantages. 
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A  neural  network/lrajectory  approach  is  presented  for  the  development  of  accurate  potent ial-energy 
hypersurfaces  that  can  he  utilized  to  conduct  ah  initio  molecular  dynamics  (AIMD)  and  Monte 
Carlo  studies  of  gas- phase  chemical  reactions*  nanometric  cutting,  and  nanotribology,  and  of  a 
variety  of  mechanical  properties  of  importance  in  potential  microelectro  mechanical  systems 
applications.  The  method  is  sufficiently  robust  that  it  can  be  applied  to  a  wide  range  of  polyatomic 
systems.  The  overall  method  integrates  ab  initio  electronic  structure  calculations  with  importance 
sampling  techniques  that  permit  the  critical  regions  of  configuration  space  to  be  determined  The 
computed  ab  initio  energies  and  gradients  arc  then  accurately  interpolated  using  neural  networks 
(NN)  rather  than  arbitrary  parametrized  analytical  functional  forms,  moving  interpolation  or 
I  cast- squares  methods,  The  sampling  method  involves  a  tight  integration  of  molecular  dynamics 
calculations  with  neural  networks  that  employ  early  stopping  and  regularization  procedures  to 
improve  network  performance  and  test  for  convergence.  The  procedure  can  be  initiated  using  an 
empirical  potential  surface  or  direct  dynamics.  The  accuracy  and  interpolation  power  of  the  method 
has  been  tested  for  two  cases,  the  global  potential  surface  for  vinyl  bromide  undergoing 
uni  molecular  decomposition  via  four  different  reaction  channels  and  nanometric  cutting  of  silicon. 
The  results  show  that  the  sampling  methods  permit  the  important  regions  of  configuration  space  to 
be  easily  and  rapidly  identified,  that  convergence  of  the  NN  fit  to  the  ab  initio  electronic  structure 
database  can  be  easily  monitored,  and  that  the  interpolation  accuracy  of  the  NN  fits  is  excellent, 
even  for  systems  involving  five  atoms  or  more.  The  method  permits  a  substantial  computational 
speed  and  accuracy  advantage  over  existing  methods,  is  robust,  and  relatively  easy  to  implement. 
©  2005  American  Institute  of  Physics.  [DOl:  10, 1063/ LI  850458] 


I,  INTRODUCTION 

Molecular  dynamics  and  Monte  Carlo  simulations  are 
the  most  powerful  existing  methods  for  the  investigation  of 
dynamical  behavior  of  atomic  and  molecular  motions  of 
complex  systems.  To  date,  such  studies  have  been  used  to 
study  chemical  reaction  mechanisms,  energy  transfer  path¬ 
ways*  reaction  rates,  and  product  yields  in  a  wide  array  of 
polyatomic  systems.  In  addition,  molecular  dynamics  (MD) 
and  Monte  Carlo  (MC)  methods  have  been  successfully  ap¬ 
plied  to  the  investigation  of  gas- surface  reactions,  diffusion 
on  surfaces  and  in  the  hulk,  membrane  transport*  the  synthe¬ 
sis  of  diamond  using  chemical  vapor  deposition  techniques, 
nanometric  cutting,  scratching,  and  grinding,  as  well  as 
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uniaxial  tensile  properties.  The  structure  of  vapor  deposited 
rare  gas  matrices  has  been  studied  using  trajectories  proce¬ 
dures. 

Such  studies  are  becoming  increasingly  important  for  the 
investigation  of  m ic roe lectro mechanical  systems  (MEMS) 
devices  such  as  sensors*  actuators,  and  structural  members 
for  biological  medical,  and  consumer  products.1  While  me¬ 
chanical  properties  for  most  materials  at  hulk  or  macro  level 
arc  well  documented*  the  corresponding  properties  at  mi¬ 
crolevels  and  nanolevcls  are  not  easy  to  obtain.  In  addition* 
the  relationship  between  nanopropcrtics  and  microproperties 
is  not  well  known  in  many  cases.  In  order  to  design  and 
fabricate  MEMS  devices,  the  design  engineer  must  know  the 
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material  properties  at  these  levels.  Consequently*  atomic 
simulation  methods  are  becoming  a  necessity. 

With  the  advent  of  relatively  inexpensive*  powerful 
workstations*  molecular  dynamics  simulations  have  become 
routine.  Once  the  potential -energy  hypersurfacc  for  the  sys¬ 
tem  has  been  obtained,  the  MD  or  MC  compulations  are 
straightforward.  In  the  majority  of  cases,  the  computation 
lime  required  is  on  the  order  of  hours  to  a  few  days.  How¬ 
ever.  the  accuracy  of  these  simulations  depends  critically  on 
the  accuracy  of  the  potential  hypersurfacc  used.  In  this  paper, 
wc  present  an  integrated  approach  for  the  determination  of 
accurate  potentials  from  ah  in  ilia  quantum  mechanical  cal¬ 
culations  using  novelty  sampling  procedures  and  a  feedfor¬ 
ward  neural  network  approach.  The  method  is  ideally  suited 
for  addressing  systems  containing  five  atoms  or  more  wfhich 
exhibit  multiple  open  reaction  channels. 

II  PREVIOUS  APPROACHES  TO  THE  POTENTIAL 
ENERGY  SURFACE  PROBLEM 

The  major  problem  associated  with  MD/MC  investiga¬ 
tions  is  the  development  of  a  potential -energy  surface  whose 
topographical  features  are  sufficiently  close  to  those  of  the 
true,  but  unknown,  surface  that  the  results  of  the  calculations 
are  experimentally  meaningful.  Once  the  potential  surface  is 
chosen  or  computed,  all  the  results  from  any  quantum  me¬ 
chanical*  sem  Classical,  or  classical  scattering  or  equilibrium 
calculation  arc  determined.  The  only  purpose  of  the  dynam¬ 
ics  calculations  is  to  ascertain  what  these  results  arc.  There¬ 
fore,  the  most  critical  part  of  any  MD  or  MC  study  is  the 
development  of  the  potential -energy  hypersurfacc  and  asso¬ 
ciated  force  field.  Surprisingly*  this  is  often  the  portion  of  the 
investigation  to  which  the  least  effort  is  devoted.  This  situa¬ 
tion  arises  because  of  the  inherent  difficulty  associated  with 
this  part  of  the  overall  problem. 

A,  Empirical  and  semiempirical  potential  surfaces 

When  four  or  more  many-electron  atoms  arc  present, 
global  potentials  have  usually  been  obtained  using  empirical 
methods  which  rely  upon  ad  hoc  parametrized  functional 
forms  suggested  by  physical  and  chemical  considerations. 
The  parameters  contained  in  these  forms  arc  generally  fitted 
to  equilibrium  thermochemical,  spectroscopic,  and  structural 
data.  If  experimental  activation  energies  are  known,  the  bar¬ 
rier  heights  predicted  by  the  empirical  potential  are  adjusted 
to  these  values.  Such  methods  have  been  employed  hy  nu¬ 
merous  investigators  since  the  mid  1970s,  For  example, 
Sewell  and  Thompson2  used  this  method  to  study  C-C\  and 
C-H  bond  scission  reactions  in  activated  two-chloroethyl 
radicals.  Similar  calculations  have  been  reported  by  Marks* 
who  studied  -N=C-*“C=N  isomerization  in  HD2CNC. 
Rice  and  Thompson4  have  examined  the  uni  molecular  de¬ 
composition  reactions  of  CH^N02  to  yield  CH3+NOi  and 
CHjO  +  NO  by  writing  the  potential  surface  as  a  summation 
of  channel  potentials  in  which  each  term  represents  one  of 
the  asymptotic  limits  CHjNCL,  CH3  +  N02,  or  CHjO  +  NO. 
Alimi.  Apkarian,  and  Gerber5  have  used  empirical  methods 
to  obtain  the  potential  surface  for  their  studies  of  pressure 
effects  of  CL  dissociation  in  Xc  matrices.  Lend  ray  and 


FIG,  L  Snapshot  during  the  MD  simulation  of  nano  mi*  trie  cutting  of  silicon. 


Schaiz6/?  fitted  spectroscopic  data  to  study  energy  transfer  in 
CS2+X  (X=H2*CO,HCI,CS2,CH4)  and  5F6+X  (X 
=  He, Ar*Xe,CO,SF6)  systems,  respectively.  Budcnholzer. 
Chang,  and  Huang®  have  investigated  the  dissociation  and 
intramolecular  vibrational  relaxation  of  dimethyl  peroxide 
with  three  potential  surfaces  developed  with  Lhesc  tech¬ 
niques.  We  employed  empirical  potentials  to  conduct  exten¬ 
sive  calculations  on  the  gas-phase  and  matrix -isolated  reac¬ 
tion  dynamics  for  the  unimolecular  decomposition  of  1.2- 
difluoroe thane  and  the  hi  molecular  reaction  of  F2  with 
ethylene*9^15 

All  machining*  indentation,  uniaxial  tension,  and  tribo¬ 
logical  studies  to  date  have  relied  on  empirical  potential s*6-36 
such  as  pairwise  sums  of  Morse ' 8  or  Lennard- Jones1'  poten¬ 
tials  for  metals,  Baskcs  and  his  colleagues  have  developed  an 
embedded-atom  method  (EAM)*2  and  modified  embedded- 
atom  methods  (Refs.  33-35)  for  application  to  fee  and  bcc 
metals,  and  even  to  hep  metals.  For  covalently  bonded*  semi¬ 
conductor  materials*  such  as  Si*  Ge*  and  diamond*  many- 
body 

potentials.  such  as  Tersoffs  potential,36  the 
Bolding-Anderson51  potential,  and  the  Brenner50  potential 
have  been  developed.  These  potentials  provide  approximate 
descriptions  of  most  materials  at  or  near  their  equilibrium 
configuration.  However,  when  the  lattice  atoms  are  in  con¬ 
figurations  far  removed  from  equilibrium,  the  accuracy  of 
any  empirical  potential  can  be  expected  to  decrease  signifi¬ 
cantly. 

A  typical  example  can  be  seen  in  the  simulation  of  na¬ 
nometric  cutting  of  silicon.  In  this  case,  large  energy  input 
during  culling  and  extensive  plastic  deformation  ahead  of  the 
tool  often  produces  a  variety  of  amorphous  clusters  that  are 
far  from  equilibrium.  Examples  of  such  configurations  arc 
seen  in  Fig.  1  both  beneath  and  directly  in  front  of  the  cutting 
tool  as  well  as  in  the  chip.  Since  the  parameters  contained  in 
empirical  potentials  are  adjusted  using  primarily  equilibrium 
data,  it  is  very  unlikely  that  the  energies  and  gradients  for 
these  amorphous  structures  arc  accurately  represented  by  the 
potential  surface.  A  similar  situation  exists  in  other  plastic 
deformation  processes,  such  as  nanotension  and  nanoinden- 
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tat  ion.  It  also  occurs  in  many  chemical  reactions  that  sample 
regions  of  configuration  space  that  are  far  removed  from 
equilibrium. 

The  limitations  and  liabilities  of  the  use  of  empirical 
potential  surfaces  in  MD  calculations  may  be  summarized  as 
follows. 

(1)  Since  the  ad  hoc  empirical  functional  form  is  simply 
concocted  with  little  or  no  theoretical  foundation,  it  is  highly 
unlikely  that  it  will  accurately  predict  the  correct  experimen¬ 
tal  force  field. 

(2)  The  parameters  contained  in  an  empirical  potential 
are  generally  adjusted  to  fit  equilibrium  and  stationary -point 
data,  such  as  bond  lengths,  bond  angles,  vibrational  frequen¬ 
cies,  reactant  and  product  equilibrium  energies,  elastic 
moduli,  sublimation  energies,  and  potential  barrier  heights. 
As  a  result,  the  potential  usually  fits  the  stationary  points  on 
the  surface  acceptably  well.  However,  MD  and  MC  simula¬ 
tions  are  primarily  concerned  with  the  energies  and  forces  at 
nonstationary  points.  It  is  very  unlikely  that  these  quantities 
are  accurately  represented  in  configurations  far  from  cquilib- 
rium- 

(3)  If  the  database  contains  data  from  ah  initio  quantum 
mechanical  calculations  at  nonstationary  points  as  well  as 
experimental  data,  the  problem  of  fitting  the  empirical  pa¬ 
rameters  becomes  very  difficult.  Such  fitting  must  be  accom¬ 
plished  using  iterative  adjustment  methods  that  often  require 
months  of  effort.  For  example,  nine  months  of  laborious  it¬ 
erative  fitting  was  required  to  obtain  an  acceptable  fit  of  the 
100  parameters  contained  in  the  empirical  function  used  to  fit 
the  global  potential  surface  for  vinyl  bromide 

(4)  Once  an  empirical  potential  is  developed  for  a  par¬ 
ticular  complex  system,  there  is  no  straightforward  means  by 
which  h  can  be  improved.  Any  alteration  of  the  potential 
requires  that  the  entire  fitting  process  be  repeated. 

(5)  Empirical  potentials  are  specific  for  the  system  under 
consideration.  The  procedures  used  to  obtain  such  potentials 
cannot  be  easily  automated  and  the  methods  applied  to  all 
systems.  Every  system  must  be  treated  individually.  There¬ 
fore,  the  research  is  tedious  and  labor  intensive. 

Each  of  the  above  problems  is  essentially  intractable. 
Because  of  this,  the  accuracy  of  molecular  dynamics  simu¬ 
lations  on  complex  systems  has,  for  practical  purposes, 
reached  its  limit.  If  we  wish  to  advance  beyond  this  natural 
limit  for  empirical  and  hybrid  potentials,  we  must  develop 
ways  to  effectively  utilize  quantum  mechanical  methods  that 
have  the  power  to  produce  more  accurate  surfaces  and  force 
fields, 

B,  Ab  initio  potential -energy  surfaces 

In  principle,  ah  initio  methods  can  produce  much  more 
accurate  potential  surfaces  for  molecular  dynamics.  The  fun¬ 
damental  problem  is  thai  electronic  structure  calculations  be¬ 
come  computationally  intractable  when  the  number  of  atoms 
and  electrons  present  in  the  system  becomes  large.  Neverthe¬ 
less,  excellent  results  have  been  obtained  for  numerous 
three-  and  four-body  systems.  In  this  section,  wc  present  a 
brief  review  of  some  of  systems  studied  and  the  methods 
employed  to  obtained  the  required  ah  initio  potentials. 
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In  the  case  of  simple,  three-body  reactions,  such  as  H 
+  D2— *HD+D  or  H2+F— t-HF+H,  accurate  ah  initio 
potential -energy  surfaces  can  be  obtained  with  a  minimum  of 
difficulty,38  The  total  configuration  space  is  sufficiently  small 
that  electronic  structure  calculations  of  the  energies  and  gra¬ 
dients  can  be  carried  out  in  all  regions.  Since  only  three 
internal  coordinates  arc  required,  interpolation  between  the 
computed  points  presents  no  formidable  obstacles.  For  ex¬ 
ample.  this  approach  has  been  employed  by  Bodo  et  al  A<]  to 
obtain  potential  surfaces  for  the  reactions 

LiH{X  'S+)  +  H(-5)  -  Li(2S)  +  H2(X  'S+) 
and 

LiH*(X:5+)  +  H(Jj)  -  Li(’s)  +  H2(X  'S+). 

The  ab  initio  energies  were  computed  for  H  341  different 
geometries  using  configuration  interaction  methods.  These 
data  were  then  fitted  using  Aguado- Paniagua  (AG)  type 
functions  and  a  simple  multipolar  expansion  at  long  range 
with  switching  functions  to  smoothly  join  the  two  forms. 

When  the  system  of  interest  contains  four  atoms,  it  is 
usually  no  longer  possible  to  conduct  electronic  structure 
calculations  in  all  regions  of  configuration  space.  Conse¬ 
quently,  importance  sampling  procedures  must  be  employed 
to  identify  the  critical  regions  of  configuration  space.  In  ad¬ 
dition,  interpolation  between  the  computed  points  on  the  sur¬ 
face  becomes  increasingly  difficult. 

Ischtwan  and  Collins'10  have  developed  a  moving  inter¬ 
polation  technique  in  which  the  potential  energy  in  the 
neighborhood  of  any  point  is  approximated  by  a  Taylor  series 
using  inverse  bond  length  coordinates  as  the  expansion  vari¬ 
ables.  Initially,  a  set  of  system  configurations  along  the  reac¬ 
tion  path  is  selected  using  chemical  intuition.  Ab  initio  elec¬ 
tronic  structure  calculations  of  the  system  energy  and 
gradients  in  these  configurations  are  then  executed.  The  data 
thus  produced  are  employed  to  obtain  the  set  of  initial  Taylor 
series  expansions.  A  Shepard  method4  is  used  to  effect  the 
required  interpolations.  This  procedure  expresses  the  poten¬ 
tial  at  any  configuration  as  a  weighted  average  of  the  Taylor 
series  about  all  N  points  in  the  data  set,  or.  alternatively, 
those  points  within  a  specified  cutoff  radius.  Subsequently, 
the  results  are  iteratively  improved  by  computing  trajectories 
on  the  Taylor  series  fitted  surface  and  recording  the  internal 
coordinates  at  a  series  of  successive  configurations  encoun¬ 
tered  in  the  trajectories.  These  new  points  are  added  to  the 
data  set  according  to  a  weight  factor  that  is  determined  by 
the  relative  density  of  points  in  the  data  set  in  the  region  of 
the  new  points.  The  weight  is  large  if  the  configuration  is 
close  to  a  large  number  of  trajectory  points  which  arc  them¬ 
selves  not  close  to  the  current  points  in  the  data  set.  Ischtwan 
and  Collins40  applied  the  method  to  the  NH  +  H2^NH2+H 
reaction.  Comparison  of  the  fitted  surface  to  the  exact  ah 
initio  energies  at  a  set  of  new  points  showed  that  the  average 
ahsolute  error  in  the  potential  after  three  iterations  was 
12.13  kJmor1.  The  average  relative  errors  in  the  surface 
gradient  at  this  point  was  56%, 

Jordan,  Thompson,  and  Collins43  used  the  moving 
i  me  rpol  a  lion-trajectory  sampling  method  to  investigate  the 
dynamics  of  the  OH  +  H2— vH2G+H  reaction.  The  criterion 
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used  to  assess  convergence  of  the  iteratively  improved  po¬ 
tential  10  the  final  surface  was  s<imc  computed  dynamic  vari¬ 
able  such  as  the  distribution  of  product  angular  momentum, 
distribution  of  product  relative  translational  energy,  or  reac* 
lion  probability. 

Numerous  four-atom  reactions  have  been  investigated 
using  the  moving  interpolation -trajectory  method.  In  some  of 
these  studies,  efficient  microcanomcal  sampling44'45  and 
Monte  Carlo  methods  arc  employed  to  replace  the  arbitrary 
choice  of  initial  configurations  along  the  reaction  coordinate. 
Collins  and  Bettens  have  studied  hydrogen-atom 
abstraction46  in  BeH2,  BeH2+H— *  BeH+H2,  and  the  reac¬ 
tion  of  atomic  nitrogen  with  Hj,47  M{4$)  +  H^Uj)  —  NHt 
+  IL  using  these  methods.  The  OH  +  H2  — t*H2G+H  reaction 
has  been  investigated  by  Thompson  and  Collins,48  Bettens, 
Hansen,  and  Collins49  have  examined  the  reaction  H4+0 
— +H204+H  and  Bettens  and  Collins50  have  reported  results 
for  the  C(V)+Hj  system. 

The  moving  interpolation-trajectory  method  with  Shep¬ 
ard  interpolation  has  also  been  employed  in  studies  of  quan¬ 
tum  reaction  dynamics.  Collins  and  Zhang51  reported  such  a 
study  on  the  reaction  BcH  +  H2— » BeH2+H.  Nguyen,  Rossi, 
and  Tmhlar52  used  a  dual  level  approach  that  combined  a 
small  number  of  high-level  electronic  structure  calculations 
with  a  lower-level  neglect -of- diatomic-differential -overlap 
(NDDO)  global  surface  obtained  with  a  modified  Shepard 
interpolation41 42  in  an  effort  to  determine  the  extent  to  which 
useful  dynamics  information  can  be  obtained  with  a  mini¬ 
mum  of  effort  devoted  to  electronic  structure  calculations. 
Bettens  and  Collins53  have  reported  a  study  of  the  accuracy 
of  the  potential  surfaces  obtained  using  this  method  for  nine 
four-body  systems  that  include  BcH*,  CHj,  NH3,  OH3, 
NeCOH4,  ArCOH\  and  KrCOH*.  For  a  review  of  these 
methods,  sec  Ref.  54, 

Other  methods  for  obtaining  ab  initio  potential  surfaces 
for  three-  and  four-body  systems  have  been  suggested.  One 
of  these  employs  the  criterion  of  reproducing  kernel  Hilbert 
space  (RKHS)  as  the  means  10  effect  the  required  interpola¬ 
tion  between  the  computed  ah  initio  energies.  Ho  el  al 
have  employed  this  procedure  to  investigate  the  0(‘D)  +  H2 
— ►OH  +  H  reaction.  When  1280  ah  initio  data  points  were 
employed  to  define  the  surface,  the  interpolation  yielded  a 
surface  whose  absolute  rrns  error  compared  to  ab  initio  en¬ 
ergies  was  1.26  kimo!-1.  Pederson  et  tf/.56  have  used  the 
same  method  to  study  the  dynamics  of  the  N(2D)  +  H2  reac¬ 
tion, 

Maisuradze  et  al.57  have  introduced  an  interpolating 
moving  least  squares  (IMLS)  method  to  effect  the  interpola¬ 
tion  between  the  computed  ah  initio  points.  When  the 
method  is  unrestricted,  the  least-squares  coefficients  are  ob¬ 
tained  from  the  solution  of  a  large  matrix  equation  that  must 
be  solved  repeatedly  during  a  trajectory  study.  In  this  form, 
the  computational  time  required  increases  with  NM2,  where 
N  is  the  number  of  data  points  10  he  fitted  and  M  is  the 
number  of  bases  functions  used  in  the  linear  combination 
that  provides  the  fit.  The  method  was  initially  tested  for  a 
one-dimensional  Morse  oscillator  and  a  one-dimensional 
slice  of  the  HN 2  surface.57  In  later  work,  Guo  et  a/.58  evalu¬ 
ated  the  IMLS  method  using  the  analytic  surface  of  Kuhn  et 
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TABLE  I,  Comparison  of  interpolation  accuracy  of  ab  initio  energy  Oh¬ 
io  ined  using  various  methods  for  three-  and  four- body  systems. 


nos  or  absolute  average 


System 

N“ 

Method 

energy  cnor  (kJ  mot’1) 

Reference 

oh3 

1,280 

RKHS 

1.26 

55 

Hy 

550 

MSI1, 

0  154 

61 

c+h; 

406 

MSI1’ 

0.6 

50 

h;+o 

I.D0O 

MSI” 

0,5 

49 

BeH+H3 

1.300 

MSI” 

0, 1  %-0.4% 

51 

OH  +  H2 

31 

MSI1" 

34.2 

52 

BeH2  +  H 

438 

MSI" 

0.35 

46 

QH  +  Hj 

400 

MSI” 

1,0 

48 

NH  +  Hj 

600 

MSI" 

12.1 

40 

N  +  H3 

699 

MSI* 

0.05 

47 

BeHj 

500-1000 

MS!* 

0,03-2.39 

53 

CH] 

500-1000 

MSI* 

0.03-2.39 

53 

nh; 

500-1000 

MSI* 

0.03-2.39 

53 

oh; 

500-1000 

MSI* 

0,03-2,39 

53 

OHj 

500-1000 

MSI* 

0.03-2.39 

53 

NeCOH* 

500-1000 

MSI* 

0.03-2.39 

53 

AiCOH* 

500-1000 

MSI* 

0.03-2.39 

53 

KiCOH* 

500-1000 

MSI* 

0,03-2,39 

53 

HOOH 

6489 

IMIS' 

4.52 

60 

HOOH 

300 

IMLS' 

8,24-15,6 

58 

HOOH 

6489 

MSI* 

1.95 

60 

*N  is  the  number  of  ah  initio  points  computed. 
hMSl  is  the  modified  Shepard  interpolation  procedure, 
stMLS  is  the  interpolating  moving  least  squares  method. 


at59  for  simple  0-0  bond  rupture  of  HQQH,  With  jV=  300. 
the  results  showed  the  interpolation  errors  to  lie  in  the  range 
8,2-16.5  kJ  mol"1.  Kawano  et  a/,60  have  compared  the  accu¬ 
racy  of  the  IMLS  method  with  that  of  the  modified  Shepard 
procedure  for  the  HOOH  bond  scission  reaction.  With  N 
-6489,  the  rms  errors  for  IMLS  and  modified  Shepard  were 
4.52  and  L95  ki  mol-1,  respectively. 

Table  1  provides  a  convenient  comparison  of  the  interpo¬ 
lation  accuracy  obtained  with  the  various  methods  that  have 
been  employed  for  three-  and  four-body  systems. 

When  the  number  of  atoms  in  the  system  becomes  five 
or  more,  the  difficulty  of  the  problem  increases  significantly. 
The  dimensionality  of  the  system  increases,  and  as  a  result, 
the  size  of  the  configuration  space  over  which  interpolation 
must  be  executed  is  much  larger.  If  more  than  one  reaction 
channel  is  open,  these  difficulties  become  even  greater 
Thompson.  Jordan,  and  Collins62  have  investigated 
hydrogen-atom  abstraction  in  the  CH4+H  system.  When  the 
configuration  sampling  is  done  by  initiating  trajectories  in 
the  H+CH4  region  of  configuration  space,  the  interpolation 
errors  produced  by  the  Shepard  moving  method  arc  about 
1 ,6  kJ  mol-1.  This  increases  to  3,5  kJ  mol'1  if  the  trajectories 
are  initiated  in  the  H2+CH3  region  of  the  surface.  The  asso¬ 
ciated  errors  in  the  relative  gradients  are  reported  to  be  about 
8%  and  12%-13%,  respectively. 

Similar  investigations  reported  by  Collins  and  Radom61 
for  the  exchange  reaction  of  HF  with  HOC4  gave  interpola¬ 
tion  errors  of  1.5  kJ  mol-1  when  HKX)  ab  initio  points  were 
included  in  the  database.  When  the  size  of  the  system  was 
increased  to  six  atoms  for  study  of  the  exchange  reaction 
H2G+HOC4.  the  rrns  error  increased  to  3.5  kJ  mol-1.  When 
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TABLE  II  Interpolation  accuracy  of  ah  mil  fa  energies  obtained  using  the 
modified  Shepard  interpolation  (MSI)  for  five-  and  six -body  systems. 


System 

AT 

rms  or  absolute  average 
energy  error  0d  mol"*) 

Relative  % 
error  in  gradient 

Re  Terence 

H  +  CHj 

1250 

L6 

(reactant  valley  sampling) 

8% 

62 

H  +  CH, 

1 250 

35 

(product  valley  sampling) 

I2%“1 3% 

62 

HF+HOC* 

1357 

1.5 

Not  reported 

63 

h3o+hqc* 

3047 

3*5 

Not  reported 

63 

h;+hd 

728 

0.17 

Not  reported 

64 

*N  is  the  number  of  ab  hut  fa  points  computed. 


a  modified  Shepard  interpolation  was  employed  to  study  the 
reactions 

h;  +  hd->h2d+  +  h2 

and 

Hj+D2— iHDj  +  HD, 

[he  average  absolute  interpolation  error  was  0.34  kJ  mot-1, 
Crepos  et  a/.65  have  employed  the  Shepard  interpolation 
method  to  study  surface  reactions  of  H2  on  Pt( HI).  Table  II 
summarizes  the  results  for  five-  and  six -atom  systems. 

In  order  to  avoid  the  extremely  difficult  task  of  interpo¬ 
lating  ab  initio  electronic  structure  data,  some  investigators 
have  employed  a  method  usually  called  “direct  dynamics/* 
In  this  approach,  trajectories  are  computed  by  direct  calcula¬ 
tion  of  the  force  field  at  each  integration  point  using  some  ab 
initio  quantum  mechanical  method.  Because  of  the  huge 
number  of  computations  required  for  each  trajectory,  the  ab 
initio  method  chosen  must  usually  be  some  form  of  density 
functional  theory,  For  example,  it  may  be  necessary  to  follow 
the  dynamics  for  several  picoseconds.  With  an  integration 
step  size  on  the  order  of  a  femtosecond.  1000  or  more  inte¬ 
gration  steps  will  be  required.  If  the  forces  arc  computed  at 
four  points  in  each  step,  something  on  the  order  of  4000 
gradients  will  have  to  be  computed  for  each  trajectory.  As  a 
result,  only  a  very  limited  number  of  trajectories  can  be  ob¬ 
tained.  As  the  system  under  investigation  increases  in  size, 
the  computational  requirements  will  quickly  overwhelm  the 
available  computational  resources,  The  basic  problem  is  that 
all  the  information  obtained  about  the  potential-energy  sur¬ 
face  and  i  he  corresponding  force  field  during  the  integration 
of  the  trajectory  is  discarded  after  the  completion  of  the  cal¬ 
culation.  Therefore,  each  subsequent  trajectory  fails  to  profit 
from  all  the  computational  effort  expended  in  obtaining  the 
previous  trajectories. 

The  conventional  wisdom  is  that  ab  initio  molecular  dy¬ 
namics  calculations  for  complex  systems  containing  five  or 
more  atoms  with  several  open  reaction  channels  are  presently 
beyond  our  computational  capabilities.  The  rationales  for  this 
view  are  (a)  the  inherent  difficulty  of  high-level  ab  initio 
quantum  calculations  on  complex  systems  that  may  make 
numerous,  large-scale  computations  impossible,  (b)  the  large 
dimensionality  of  the  configuration  space  for  such  systems 
that  makes  it  necessary  to  examine  prohibitively  large  num¬ 
bers  of  nuclear  conformations,  and  (e)  the  extreme  difficulty 
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associated  with  accurate  interpolation  of  numerical  data  ob¬ 
tained  from  electronic  structure  calculations  when  the  dimen¬ 
sionality  of  the  system  is  nine  or  greater.  In  this  paper,  we 
present  general  methods  to  handle  each  of  these  difficulties 
for  many  complex  systems.  Furthermore,  as  our  computa¬ 
tional  resources  increase,  the  range  of  systems  that  can  be  so 
treated  will  expand  without  the  need  to  significantly  modify 
the  methods.  Therefore,  the  theoretical  approach  is  robust 
and  powerful. 

Ill,  THE  NEURAL  NETWORK  APPROACH 

In  this  paper,  we  present  an  integrated  approach  for  ob¬ 
taining  ab  initio  quantum  mechanical  potential -energy  sur¬ 
faces  and  force  fields  for  use  in  molecular  dynamics  simula¬ 
tions  for  systems  of  five  or  more  atoms  in  which  several 
reaction  channels  may  be  energetically  open.  The  method 
involves  0)  electronic  structure  calculations  of  energies  and 
force  fields  for  an  ensemble  of  atoms  whose  interatomic  dis¬ 
tances  are  such  that  all  atoms  fall  within  a  previously  speci¬ 
fied  distance  from  some  central  point,  generally  called  the 
cutoff  radius,  (2)  the  implementation  of  sampling  methods 
that  utilize  MD  calculations  and  novelty-sampling  to  permit 
the  regions  of  configuration  space  that  arc  important  in  the 
dynamics  to  be  accurately  identified,  and  (3)  the  use  of  neu¬ 
ral  networks  (NN),  early  stopping,  and  regularization  meth¬ 
ods  to  provide  rapid  and  accurate  interpolation  of  the  ab 
initio  database  and  convenient  tests  for  convergence  that  do 
not  require  MD  calculation  of  dynamical  results. 

A.  Electronic  structure  calculations 

The  solution  to  this  portion  of  the  problem  is  already 
available  in  the  form  of  several  software  programs  that  will 
execute  different  levels  of  electronic  structure  calculations. 
The  GAUSSIAN  (Ref.  66)  suite  of  programs  is  an  example. 
With  such  software,  it  is  possible  to  tailor  the  computational 
accuracy  to  be  in  accord  with  the  complexity  of  the  system 
under  study  and  the  extent  of  computational  resources  avail¬ 
able  to  the  user.  Extremely  complex  systems  might  be  treated 
using  density  functional  theory  with  a  smaller  basis  set. 
Moller-Plesset  perturbation  theory  at  an  appropriate  level 
(MP2,  MP3,  MP4,  MP5)  can  be  used  with  larger  basis  sets 
for  systems  whose  complexity  is  not  as  great.  For  even 
smaller  systems,  configuration  interaction  or  multi- 
referenced  methods  can  be  employed.  As  our  computational 
resources  increase,  appropriate  adjustments  can  be  made  in 
the  type  of  electronic  structure  calculation  employed.  This 
flexibility  makes  the  overall  procedure  very  robust. 

The  calculations  reported  in  this  paper  utilize  the  Linux 
version  of  GAUSS JAN98  and  a  multiprocessor  containing  26 
CPU  nodes  each  with  a  clock  speed  of  1  7  GHz.  Gur  results 
show  that  with  only  one  node  in  the  CPU  cluster,  we  can 
compute  the  potential  and  associated  forces  for  one  configu¬ 
ration  of  a  five-atom  silicon  cluster  at  the  MP4(SDQ)  level 
of  accuracy  using  a  6-31G**  basis  set  in  about  15  min.  A 
seven-atom  silicon  cluster  containing  98  electrons  requires 
about  60  min.  Smaller  clusters  require  significantly  less  com¬ 
putational  time  since  the  time  scales  approximately  with  Af\ 
where  N  is  the  number  of  electrons  present.  A  MP4(SDQ) 
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calculation  of  the  potential  and  associated  force  field  for  one 
configuration  of  vinyl  bromide  (C2H3Br)  takes  about  6.5  min 
of  CPU  lime.  If  only  the  potential  is  computed,  the  required 
time  decreases  to  1.3  min.  These  calculations  employ  a  6- 
3!G(rf,p)  basis  set  for  the  carbon  and  hydrogen  atoms  and 
Huzinaga’s  (4333/433/4)  basis  set  augmented  with  split 
outer  j  and  p  (43321/4321/4)  orbitals  for  the  bromine  atom. 
A  polarization /  orbital  is  also  added  with  an  exponent  of  0.5 
to  provide  a  more  accurate  description.  Consequently,  we  arc 
able  to  compute  the  energies  and  forces  for  96  five-atom 
silicon  clusters  per  day  per  node.  With  26  CPU  units,  each 
with  its  own  dedicated  RAM  and  hard  drive,  the  energies  and 
forces  for  about  2500  dusters  can  be  computed  per  day.  For 
a  smaller  system,  such  as  vinyl  bromide,  about  5760  ah  initio 
energies  and  associated  force  fields  can  be  obtained  per  day. 
Thus,  the  electronic  structure  calculations  for  10s  configura¬ 
tions  can  he  completed  in  about  18  days  for  vinyl  bromide 
and  about  40  days  for  five-atom  silicon  clusters. 

As  the  clock  speeds  for  computers  increase  (machines 
operating  at  over  3,0  GHz  arc  already  available),  the  compu¬ 
tational  times  required  will  decrease  correspondingly.  A  52- 
node  cluster  with  CPU  clock  speeds  of  3.4  GHz  will  be  able 
to  complete  the  computation  of  the  vinyl  bromide  and  five- 
atom  silicon  cluster  potentials  and  force  fields  in  about  4.5 
and  10  days,  respectively.  Relative  to  the  time  generally  re¬ 
quired  for  the  execution  of  a  research  project,  such  numbers 
are  smalt. 

If  we  are  concerned  with  a  gas- phase  chemical  reaction, 
the  number  of  atoms  and  electrons  that  must  be  included  in 
the  electronic  structure  calculations  is  usually  equal  to  the 
total  number  in  the  system  of  interest.  For  machining  experi¬ 
ments,  the  situation  is  not  so  clear.  The  computational  power 
required  now  depends  upon  the  nature  of  the  clusters  that 
occur  in  a  machining  experiment  and  the  cutoff  radius  used 
in  the  calculations.  For  example,  consider  the  machining  of 
silicon.  If  we  are  using  a  Tersoff  potential^  with  a  3,0  A 
cutoff  radius,  MD  calculations  show  that  we  observe  clusters 
containing  from  two  to  seven  atoms  within  this  radius  but 
none  with  more  than  seven  atoms.  Therefore,  the  electronic 
structure  calculations  must  be  able  to  handle  seven-atom 
clusters  or  less,  if  we  are  machining  a  fee  metai  such  as 
aluminum,  the  cluster  sizes  are  much  larger.  Therefore,  un¬ 
less  the  available  computational  power  is  correspondingly 
increased,  it  will  he  necessary  to  substitute  something  like 
density  functional  calculations  for  the  MP4($DQ)  methods 
discussed  above, 

B.  Sampling  procedures 

Although  we  can  compute  MP4(SDQ)  energies  and 
forces  for  thousands  of  configurations  per  day  (and  about 
10 5  per  month)  for  systems  containing  between  50  and  100 
electrons,  this  number  is  still  very  small  compared  to  the 
totality  of  the  configuration  space  available  to  the  system. 
For  example,  consider  the  three-body  LiH2  system  whose 
configurations  are  determined  by  three  internal  coordinates. 
If  reasonable  ranges  are  taken  For  each  coordinate  and  each 
range  divided  into  N  equal  segments,  the  energies  and  force 
fields  for  /V1  configurations  would  need  to  he  computed  to 
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characterize  the  system.  Bodo,  Martinazzo,  and  Yianturco40 
used  11  341  configuration  points  for  this  purpose.  This  cor¬ 
responds  to  /V«* 23.  While  such  brute  force  calculations  can 
be  executed  for  a  three-atom  system  containing  only  five 
electrons,  larger  systems  rapidly  move  completely  out  of 
range.  The  configurations  of  the  50-electron,  six-atom  vinyl 
bromide  system  require  the  specification  of  12  internal  coor¬ 
dinates,  Thus,  a  complete  characterization  would  require 
about  23 12  or  2, 19  x  10l*  points.  If  we  have  the  computa¬ 
tional  power  to  compute  the  energies  For  28  000  configura¬ 
tions  per  day,  as  suggested  by  our  results,  it  will  still  require 
about  2.14  X  I  D9  years  to  obtain  the  energies  for  each  of 
these  configurations.  Obviously,  this  is  far  beyond  our  reach 

If  all  of  the  I0J6  configurations  of  vinyl  bromide  arc 
important  in  the  MD  calculations,  ah  initio  molecular  dy¬ 
namics  (AIMD)  calculations  are  indeed  beyond  our  present 
capabilities  and  probably  beyond  those  that  will  exist  for 
decades  or  centuries  to  come.  However*  simple  chemical 
logic  tells  us  that  most  of  the  10 16  configurations  arc  of  no 
interest  in  the  dynamical  studies.  For  example,  we  know  that 
configurations  such  as  H-H-H-C-Br-C  are  of  no  interest.  In 
any  system*  the  number  of  important  configurations  forms  a 
small  subset  of  the  total  configuration  space.  Therefore,  the 
key  to  success  is  to  devise  an  importance  sampling  method 
that  adequately  samples  this  subset  rather  than  the  entire  con¬ 
figuration  space. 

As  soon  as  we  direct  our  attention  to  the  sampling  prob¬ 
lem.  it  becomes  evident  that  molecular  dynamics  does  pre¬ 
cisely  what  we  require.  Such  methods  were  first  introduced 
by  Collins  and  co-workers. 54  The  nuclear  configurations  that 
are  generated  in  an  ensemble  of  trajectories  comprise  the 
subset  of  configuration  space  that  wc.wish  to  sample.  All  wc 
need  to  do  is  to  devise  efficient  procedures  for  (a)  the  incor¬ 
poration  of  configuration  points  into  the  database,  (b)  inter¬ 
polation  between  the  points,  and  (c)  testing  for  convergence 
of  the  potential  surface  that  are  well  suited  to  use  for  com¬ 
plex  systems  involving  multiple  reaction  channels.  The  data 
listed  in  Table  ft  suggest  that  the  accuracy  of  moving  inter¬ 
polation  methods  is  decreasing  rapidly  as  the  dimensionality 
of  the  system  increases.  This  is  particularly  the  case  for  the 
surface  gradients,  which  are  of  paramount  importance  in  dy¬ 
namics  simulations.  In  addition,  Monte  Carlo  methods  or 
methods  based  on  chemical  intuition  for  initiating  the  pro¬ 
cess  are  much  less  likely  to  work  well  in  complex  systems. 
We  also  need  convergence  tests  that  are  independent  of  the 
MD  results. 

The  method  we  employ  follows  that  used  by  Collins  and 
eo- workers. 54  Wc  sample  the  important  regions  of  configura¬ 
tion  space  in  iterative  fashion  using  trajectories.  However, 
the  present  procedure  initiates  the  process  by  employing  an 
empirical  potential  for  the  system  of  interest9-2940  ^  rather 
than  using  Monte  Carlo  methods*  microcanonical 
sampling.44'45  or  chemical  intuition.  This  is  a  much  more 
effective  starting  procedure  since  it  is  unlikely  that  random 
sampling  or  intuition  will  provide  an  initial  sampling  of  the 
important  regions  of  configuration  space  in  complex  systems 
with  many  open  reaction  channels. 

One  of  the  advantages  of  the  procedure  described  in  the 
current  work  is  that  the  iteration  ts  not  sensitive  to  the  choice 
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Modified  Tereoff  potential 

HG  2,  Energies  (eV)  computed  using  two  different  Tersoff-iypc  potentials 
for  a  set  of  several  thousand  Sis  configurations  generaied  using  MD  methods 
in  a  cutting  simulation  on  a  silicon  workpiece.  If  the  two  potentials  gave  the 
same  energies  for  a  given  configuration,  all  points  would  lie  along  the  45s 
line. 

of  the  empirical  surface  employed  in  the  initial  step.  So  long 
as  i he  choice  is  reasonable,  the  iterative  procedure  will  con¬ 
verge.  As  an  illustration,  we  have  employed  two  potentials 
proposed  by  Tersoff36  for  ihe  initial  step  in  the  investigation 
of  the  potential  surface  for  five -atom  silicon  clusters.  These 
two  potentials  have  different  functional  forms  and  different 
parameters.  Figure  2  shows  a  plot  of  the  energies  computed 
using  both  potentials  for  a  set  of  several  thousand  Si5  con¬ 
figurations  generated  using  MD  methods  on  a  silicon  work- 
piece  in  nanometric  cutting  similar  to  that  shown  in  Fig.  L  If 
the  two  potentials  gave  the  same  energies  for  a  given  con¬ 
figuration,  all  points  would  lie  along  a  45*  line.  Obviously, 
the  two  empirical  potentials  predict  very  different  energies. 
Nevertheless,  when  neural  nets  are  fitted  to  each  set  of  ah 
initio  energies  obtained  from  the  configurations  generated  by 
MD  calculations  using  the  two  empirical  potentials.  Fig.  3 
shows  that  the  predictions  of  the  two  nets  for  an  independent 
testing  set  are  in  excellent  accord. 

It  should  also  be  noted  that  the  need  to  have  a  reasonable 
empirical  potential  can  be  obviated  by  using  direct  dynamics 
as  the  starting  procedure.  In  this  case,  the  interative  proce- 


NN  output  bused  on  modified 
Tersoff  potcnliiL 


FIG.  3.  Predictions  of  two  different  neural  ncls  Titled  to  ah  mitut  energies 
for  Si*  configurations  generated  by  MD  calculation*  using  the  two  potentials 
whose  results  are  shown  in  Fig.  2,  The  plotted  points  are  for  an  independent 
testing  set  tf  the  two  NN  produce  the  same  interpolated  energies,  all  results 
wilUte  along  the  45*  line  AH  energies  are  in  cV. 
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dure  is  initiated  by  computing  several  trajectories  using  di¬ 
rect  dynamics  and  storing  the  system  configurations  at 
equally  spaced  intervals  of  lime*  The  energies  and  gradients 
at  these  points  obtained  in  the  direct  dynamics  calculations 
can  then  be  used  without  modification  to  begin  the  procedure 
or,  if  desired,  higher-level  electronic  structure  calculations 
can  be  executed  on  the  stored  configurations.  In  either  case, 
the  method  now  becomes  self-starting.  In  addition,  the  com¬ 
bination  of  direct  dynamics  with  neural  networks  provides  a 
procedure  that  makes  direct  dynamics  more  powerful  This 
combination  permits  future  trajectory  calculations  to  profit 
from  the  computational  effort  expended  in  the  direct  integra¬ 
tion  of  previous  trajectories. 

A  set  of  trajectories  using  this  empirical  potential  is  now 
computed  using  standard  MD  methods/*7  As  the  trajectories 
proceed,  we  store  the  nuclear  configurations  at  equally 
spaced  time  intervals.  This  is  continued  until  wc  have  several 
thousand  such  configurations.  The  energies  and  forces 
present  in  these  configurations  are  then  computed  using  an 
appropriately  chosen  ab  initio  method.  This  data  set  com¬ 
prises  our  approximation  for  the  ab  initio  system  potential 
and  force  field. 

In  the  second  step,  feedforward  NN  are  fitted  to  the  ab 
initio  database  for  the  energies  and  forces  obtained  in  the 
first  step.  The  fitting  procedure  is  described  in  Sec.  INC  A 
second  set  of  MD  trajectories  is  now  computed  using  the  NN 
force  field  rather  than  the  original  empirical  potential.  During 
these  calculations,  additional  nuclear  configurations  are 
stored  and  their  energies  and  forces  subsequently  computed. 
As  explained  below,  novelty  sampling  algorithms  based  on 
the  currenroensity  of  configuration  points  in  the  database 
guide  our  selection  of  configurations.  Regions  with  a  low 
density  are  preferentially  included  in  our  sample.  The  ener¬ 
gies  and  forces  obtained  for  the  newly  selected  configura¬ 
tions  are  added  to  the  overall  database  characterizing  the 
potential-energy  hypersurface  and  force  field.  New'  neural 
networks  are  trained  to  fit  to  this  expanded  database  and  the 
entire  procedure  repealed.  Since  the  configurations  obtained 
from  the  MD  calculations  involve  structures  in  which  all  of 
the  internal  coordinates  are  simultaneously  vary  ing,  the  com¬ 
puted  force  fields  will  properly  represent  all  the  coupling 
terms  that  influence  intramolecular  energy  transfer  and  reac¬ 
tion  dynamics. 

After  several  iterations,  wc  expect  the  ab  initio  potential 
and  force  field  obtained  from  the  trained  NN  to  converge  to 
something  close  to  the  experimental  potential  and  force  field, 
at  least  to  the  extent  that  our  choice  of  ab  imtio  computa¬ 
tional  method  is  capable  of  yielding  energies  and  forces  that 
approach  the  experimental  values.  As  described  below,  the 
novelty  sampling  itself  provides  a  convenient  test  for  conver¬ 
gence  that  is  independent  of  results  obtained  from  MD 
calculations. 

To  ensure  that  the  NN  accurately  represents  the  potential 
and  associated  force  field  for  the  system,  it  is  essential  that 
the  training  data  he  spread  throughout  the  subset  of  configu¬ 
ration  space  that  is  important  in  the  dynamics.  No  interpola¬ 
tion  method,  including  neural  networks,  can  he  expected  to 
extrapolate  accurately.  While  there  is  little  that  we  can  do 
during  training  to  improve  the  network  performance  outside 
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the  range  of  fhe  training  data,  we  can  improve  its  ability  to 
interpolate  between  data  points,  and  we  can  work  to  ensure 
that  the  training  data  cover  the  relevant  range  of  the  input 
space. 

To  guide  our  selection  of  new  configurations  to  be  added 
to  the  database,  we  employ  a  modification  of  the  method  of 
novelty  detection  developed  by  Fukriuayakamee  and 
Hagan. 68  Let  q,  be  a  [(3/V-6)  X  l  ]  column  vector  of  the  in¬ 
ternal  coordinates  of  the  uh  duster  in  the  data  set  currently 
being  used  to  determine  the  NN  force  field.  By  computing 
trajectories  using  this  force  field*  we  generate  a  configuration 
qn*  Qualitatively,  we  wish  to  incorporate  the  forces  and  en¬ 
ergies  of  this  configuration  into  ihe  database  if  q„  lies  in  a 
region  of  configuration  space  where  the  density  of  points  in 
the  database  is  low.  However,  if  this  region  already  contains 
a  high  density  of  configuration  points*  we  wish  the  probabil¬ 
ity  of  incorporating  point  qn  to  be  much  lower 

Novelty  detection  is  based  on  the  two-norm  difference 
between  vector  qn  and  vector  qk: 

H  -  q„ll = [(q*  -  qn)r(qi  -  ( ■ ) 

where  (qj-qn)r  is  the  transpose  of  (qi^qj  A  vector  d  is 
now  defined  whose  elements  are  all  the  distances  computed 
using  Eq.  (1)  for  each  of  the  N  configuration  points  in  the 
database.  The  minimum  distance  separating  point  qn  from 
other  points  in  the  database  is  the  minimum  element  of  d  that 
wc  denote  as  dm  =  rnin(d).  The  mean  separation  distance.  (rf)* 
is  the  average  value  of  the  elements  of  d,  The  basic  concept 
is  to  incorporate  configuration  point  qn  into  the  database 
with  high  probability  if  dm  or  (d)  is  large,  but  low  probability 
if  both  d,„  and  (d)  are  small. 

A  selection  algorithm  based  solely  upon  the  magnitude 
of  dm  will  often  result  in  the  failure  to  incorporate  configu¬ 
ration  points  that  are  needed  in  the  database  or  the  incorpo¬ 
ration  of  points  that  are  not  critical.  This  occurs  because  a 
minimum  distance  criterion  does  not  incorporate  consider¬ 
ation  of  the  gradient  of  the  function  being  fitted.  In  regions 
where  the  gradient  is  large,  many  more  points  will  he  re¬ 
quired  to  obtain  an  accurate  NN  fit  than  is  the  case  in  regions 
characterized  by  smaller  gradients.  Unfortunately,  the  under¬ 
lying  potential  function  is  unknown*  Therefore,  it  is  impos¬ 
sible  to  calculate  the  gradient  at  an  arbitrary  configuration 
point  without  executing  electronic  structure  calculations, 
Pu kriti ay akamcc  and  Hagan68  have  found  l hat  if  the  mini¬ 
mum  distance  between  a  set  of  modified  configuration  vec¬ 
tors  is  used  in  place  of  dmi .  the  effect  of  the  function  gradient 
can  be  incorporated  into  the  selection  algorithm.  Wc  create 
the  set  of  modified  vectors  F  using 

r,  =  lq,0,].  (2) 

where  Or  is  the  output  from  ihc  NN  at  configuration  point  qt. 
When  the  minimum  distance,  Vmt  between  T,  and  r„  for  all 
N  configuration  points  in  the  database  is  used  in  place  of  dm , 
the  effect  of  function  gradient  is  incorporated.  This  occurs 
because  if  ihc  gradient  between  points  qt  and  q„  is  large,  the 
corresponding  NN  outputs,  G,  and  Ofl,  will  he  very  different 
and  rm  will  increase,  as  desired. 

The  first  half  of  our  sampling  algorithm  is  based  on  the 
normalized  probability  distribution  function  of  Fm  values. 


FIG.  4  Hypothetical  distribution,  of  configuration  points  in  a  data  base  Tor  a 
system  wiih  iwo  configuration  variables,  and  qlr  The  points  labeled  l,  2, 
and  3  are  outliers. 


P(rjdVm.  for  all  N  configuration  points  in  the  current  data¬ 
base.  This  distribution  function  and  its  maximum  value, 
Pirjnm,  are  computed  at  the  beginning  of  each  iterative 
cycle  in  the  sampling  procedure.  As  new  configuration  points 
are  generated  during  the  trajectories  computed  using  the  cur¬ 
rent  NN  force  field,  they  are  added  to  the  database  if 

and 

r  r* 

1  m  1  m 


or  if 


(3) 


Otherwise  the  new  configuration  point  will  be  rejected  by  the 
first  test,  in  Eq.  (3),  Tx  is  an  empirically  determined,  accep¬ 
tance  threshold  in  the  range  (O^Ti  ^  l)  and  £  is  a  random 
number  whose  distribution  is  uniform  on  the  interval  [0*1  j. 
T*  is  a  small  value  of  Vm  chosen  so  as  to  exclude  points  that 
are  very  near  other  points  in  the  database.  Examination  of 
Eq.  (2)  shows  that  the  new  point  will  be  accepted  if  very  few 
points  in  the  database  have  fm  values  near  that  for  the  new 
point.  It  will  also  be  accepted  with  a  probability  that  de¬ 
creases  as  ^(F^)  approaches  its  maximum  value 

To  he  certain  that  our  sampling  does  not  exclude  con¬ 
figuration  points  that  arc  outliers,  we  must  augment  the  first 
half  of  the  sampling  algorithm  with  an  additional  test  that 
will  include  outliers.  An  outlier  configuration  point  is  usually 
defined  as  one  that  is  inconsistent  with  the  remainder  of  the 
database  and  one  that  does  not  follow  the  majority *s  under¬ 
lying  correlation.69 

Suppose  we  have  two  configuration  points  in  the  data¬ 
base  that  arc  far  removed  from  the  remainder  of  the  points. 
This  situation  is  illustrated  in  Fig.  4  for  the  simple  case  in 
which  the  configuration  depends  only  upon  two  coordinates, 
qx  and  The  points  denoted  as  1.2.  and  3  are  outliers.  Now 
suppose  one  of  the  iterative  cycles  generates  the  configura¬ 
tion  point  denoted  by  Point  3  in  Fig.  4.  Since  Fm  for  Point  3 
is  about  the  same  as  that  for  most  of  the  members  of  the 
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Inputs  Layerl  _ Layer2 


r - f  ^ 


a'  =  f ’(iw’p+b1)  a*»f!(LW*‘la'+b*) 

RG.  5.  Schematic  of  a  two- layer  neural  network. 

database,  the  first  selection  criterion  would  very  likely  result 
in  rejection  of  this  point.  However,  if  wc  arc  to  adequately 
sample  configuration  space,  it  is  clear  that  Point  3  needs  to 
be  included  in  the  database. 

We  can  ensure  that  an  outlier  such  as  Point  3  will  be 
incorporated  into  the  database  by  using  a  second  selection 
criterion.  At  the  beginning  of  each  iterative  cycle,  we  com¬ 
pute  the  normalized  probability  distribution  function  of  (f) 
values.  P((P)W<r>,  for  all  jV  configuration  points  in  the  cur¬ 
rent  database.  From  this  distribution,  wc  obtain  the  maxi¬ 
mum  value.  P«r»miB.  If  the  first  half  of  the  selection  algo¬ 
rithm  results  in  rejection  of  a  configuration  point,  we  still 
accept  the  point  if 

PUD)  =£  r2p«r» 

max 

or  if  (4) 


r«r» 

p«r))mi 


-7y. 


Proper  selection  of  7%  result  in  the  addition  of  most 
outlier  configurations  generated  in  the  MD  iterative  calcula¬ 
tions. 


C.  Feedforward  neural  network  fitting 

To  make  the  above  procedure  a  practical  method  for  MD 
calculations,  wc  must  avoid  compromising  the  accuracy  of 
the  ab  initio  database  by  concocting  ad  hoc ,  parametrized 
functional  forms  for  fitting  and  interpolation  of  the  potential 
and  force  field.  Not  only  would  such  a  procedure  signifi¬ 
cantly  degrade  the  accuracy  of  the  potential,  it  would  also  be 
so  tedious  to  execute  that  the  overall  method  would  become 
essentially  useless.70 

Our  procedure  is  to  employ  multilayer,  feedforward  neu¬ 
ral  networks  to  fit  the  potential  and  the  associated  force  field. 
These  networks  are  then  used  in  the  MD  calculations  to  pro¬ 
vide  the  forces  for  the  instantaneous  configuration  of  the  sys¬ 
tem.  A  detailed  discussion  of  such  networks  can  be  found  in 
Hagan  cl  and  Haykin.7" 

Figure  5  is  an  example  of  a  layered  feedforward  network 
(two  layers  in  this  case).  For  a  full  description  of  the  nota¬ 
tion,  see  Dcmuih  and  Beale.73  The  input  vector  to  the  net¬ 
work  is  represented  by  p1.  which  has  R  elements.  The  super¬ 
script  represents  the  input  number,  since  it  is  possible  to  have 
more  than  one  input  vector  (although  we  use  only  one  input 
vector  in  this  research).  In  our  case,  the  elements  arc  the 
intcrparticle  distances,  three -body  angles,  and  dihedral 
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angles  required  to  specify  the  configuration.  The  input  is 
connected  to  Layer  I  through  the  input  weight  IWU,  where 
the  first  superscript  denotes  the  layer  number  and  the  second 
superscript  represents  the  input  number  The  bias  for  the  first 
layer  is  represented  by  b[.  The  net  input  vector  to  Layer  1  is 
denoted  by  n1.  It  is  computed  as 

n*  =  JWup  +  b1 .  (5) 

The  output  of  Layer  I,  a3,  is  computed  by  passing  the  net 
input  through  a  transfer  function,  according  to  a’ss/^n1). 
The  output  vector  has  Sl  elements.  The  output  of  the  first 
layer  is  input  to  the  second  layer  through  the  layer  weight, 
LW2‘L  A  network  can  have  any  number  of  layers,  although 
two-  and  three-layer  networks  are  generally  used.  The  over¬ 
all  output  of  the  network  is  typically  chosen  to  be  the  output 
of  the  last  layer  in  the  network,  although  it  could  he  the 
output  of  any  layer.73 

For  our  investigations,  the  R  elements  of  the  input  vector 
p1  comprise  interatomic  distances,  bond  angles,  and  dihedral 
angles.  If  the  system  under  study  contains  N  atoms,  p1  will 
be  a  (3/V-6)  X  1  column  vector.  The  output  vector  of  the  mb 
layer,  a",  will  contain  only  a  single  element  if  the  NN  is 
trained  to  fit  the  potential  energy  of  the  system.  If  the  NN  is 
trained  to  fit  ihe  force  field,  an  will  be  a  (3N-6)  X  I  column 
vector.  If  we  employ  a  NN  trained  to  fit  both  the  force  field 
and  the  potential  energy,  a"  will  be  a  (3N-5)x  I  column 
vector 

The  power  of  the  multilayer  network  for  function  ap¬ 
prox  imatiun  has  been  well  studied.  Homik,  Slinchcombc. 
and  White74  have  shown  that  a  two-layer  network  with  sig¬ 
moid  functions  in  the  first  hidden  layer  and  linear  transfer 
functions  in  the  output  layer  are  universal  approximators. 
Pifikus75  gave  a  more  recent  review  of  the  approximation 
capabilities  of  neural  networks,  Niyogi  and  Girosi'*  devel¬ 
oped  bounds  on  function  approximation  error  when  the  net¬ 
work  is  trained  on  noisy  data.  In  this  research,  our  task  is  to 
choose  the  best  neural  network  architecture  to  predict  the 
required  potentials  and  force  fields  from  the  supplied  internal 
coordinates  of  the  configuration  and  then  to  train  the  appro¬ 
priate  network  to  find  the  optimal  set  of  weights  and  biases. 

NN  training  algorithms  arc  generally  gradient-based  op¬ 
timization  procedures  that  are  designed  to  minimize  the 
mean  square  error  between  the  outputs  of  the  network  and 
the  target  outputs  over  the  appropriately  chosen  training  set. 
That  is,  the  algorithm  is  provided  with  a  set  of  examples  of 
proper  network  behavior;  {(Pi '*i)*(P2' *2)* 
where  pv  is  an  input  vector  of  internal  coordinates  to  the 
network  and  is  the  corresponding  target  output  (e.g.,  po¬ 
tential  energy).  As  each  input  q^  is  applied  to  the  network, 
the  corresponding  network  output  is  compared  to  the  tar¬ 
get.  The  training  algorithm  adjusts  the  network  parameters  in 
order  to  minimize  the  sum-squared  error: 

0 

F(w)  =  2  -  a^)r{tw  -  a„) .  (6) 


where  w  is  a  vector  containing  all  network  weights  and  bi¬ 
ases. 
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Some  training  algorithms  are  incremental,  in  which  the 
weights  are  updated  after  each  randomly  selected  input  is 
presented  to  the  network.  The  extended  Kalman  filter 
algorithms77  are  examples.  Other  algorithms  have  a  batch 
form,  in  which  all  inputs  are  presented  to  the  network  before 
an  update  is  made,  as  in  the  Levenberg-Marquardl 
algorithm,78  The  results  reported  here  have  utilized  primarily 
the  Levenberg-Marquardt  procedure. 

Before  training  the  network,  we  must  select  the  number 
of  neurons  and  I  he  number  of  layers  necessary  to  achieve  an 
accurate  approximation.  We  also  need  to  determine  how  the 
training  data  set  should  be  selected.  The  key  in  selecting  the 
network  architecture  is  to  use  enough  neurons  to  capture  the 
complexity  of  the  underlying  function  without  having  the 
network  overfit  the  training  data,  in  which  case  it  will  not 
generalize  accurately  to  new  situations.  If  the  number  of  neu¬ 
rons  is  too  large,  the  network  will  overfil  the  training  data.  In 
this  case,  the  network  response  perfectly  matches  the  training 
data,  but  interpolation  between  the  training  points  is  poor 
because  the  network  has  too  many  independent  parameters 
that  have  not  been  constrained  in  any  way. 

Network  interpolation  can  be  improved  by  a  variety  of 
techniques.  In  one  method,  called  "early  stopping,'1  we 
place  a  portion  of  the  training  data  into  a  validation  data  set. 
The  performance  of  the  network  on  the  validation  set  is 
monitored  during  training.  During  the  early  siages  of  train¬ 
ing,  the  validation  error  will  decrease.  If  overfitting  begins, 
the  validation  error  will  begin  to  increase.  At  this  point, 
training  is  stopped.  Another  method  to  improve  the  interpo¬ 
lation  performance  of  the  network  is  called  ''regularization/' 
With  this  method,  the  performance  index  is  modified  to  in¬ 
clude  a  term  that  penalizes  network  complexity.  The  com¬ 
mon  penalty  term  is  the  sum  of  squares  of  the  network 
weights.  That  is,  we  replace  Eq,  (6)  with 

Q  n 

Fi  w)  =  2  %  “  “  V  +  pS  (w/)2,  (?) 

</=i  W 

This  performance  index  forces  the  weights  to  be  small, 
which  produces  a  smoother  (less  complex)  network  re¬ 
sponse.  The  key  to  this  method  is  choosing  the  best  regular¬ 
ization  parameter  p.  If  the  value  is  too  large,  the  network 
response  will  be  too  smooth  and  will  not  accurately  approxi¬ 
mate  the  underlying  function.  If  the  value  is  too  small,  the 
network  will  overfil.  There  are  several  procedures  for  select¬ 
ing  the  optimal  value  of  p,  One  of  the  most  successful  is 
Bayesian  regularization*0  1 

In  both  early  stopping  and  regularization,  we  gradually 
increase  the  number  oF  neurons  in  the  neural  network  until  it 
is  more  than  large  enough  to  provide  an  accurate  approxima¬ 
tion.  However,  wc  constrain  the  size  of  the  weights  and  the 
number  of  iterations  of  the  training  algorithm.  These  con¬ 
straints  reduce  the  effective  number  of  parameters  in  the  net¬ 
work  and  prevent  overfilling.  In  this  research,  we  use  a  com¬ 
bination  of  regularization  and  early  stopping  to  produce  the 
best  generalization  82 

A  very  important  consideration  in  network  performance 
is  to  have  enough  training  points  appropriately  spaced 
throughout  the  configuration  space  that  is  important  in  the 
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process  under  study.  Neural  networks  do  not  extrapolate  ac¬ 
curately,  Therefore,  we  must  have  adequate  training  data  in 
all  important  regions  of  configuration  space.  This  will  be 
accomplished  using  novelty  detection.68 

Since  feedforward  neural  networks  can  he  easily  differ¬ 
entiated  to  yield  the  gradients  of  the  function  being  fitted,  the 
method  provides  us  with  a  choice  of  three  procedures  to 
execute  MD  calculations.  We  can  use  a  NN  to  fit  the  com¬ 
puted  potentials  and  then  obtain  the  force  fields  required  by 
the  MD  calculations  by  differentiation  of  the  network  func¬ 
tion.  Alternatively,  we  can  use  a  separate  neural  network  to 
fit  the  force  fields  computed  by  the  ah  initio  calculations. 
This  network  can  then  be  used  in  the  MD  calculations  to 
produce  the  forces  directly.  A  third  option  is  train  one  net¬ 
work  to  fit  the  computed  potentials  and  at  the  same  time, 
have  the  derivatives  of  the  network  function  with  respect  to 
its  inputs  fit  the  force  fields.  The  performance  index  for  train¬ 
ing  this  network  would  be  a  combination  of  the  squared  error 
on  the  computed  potentials  and  the  squared  error  on  the  force 
fields.  To  implement  a  gradient-based  algorithm  on  the  com¬ 
bined  performance  index,  we  would  need  to  compute  the 
derivative  with  respect  to  the  network  weights  of  the  network 
function  derivative  with  respect  to  its  inputs.  This  requires 
back  propagation  operations  that  are  similar  to  those  required 
for  Hessian  calculations,83  If  the  NN  fits  are  exact,  the  three 
methods  are  equivalent.  However,  since  this  will  not  be  the 
case,  all  three  procedures  can  be  investigated  to  determine 
the  one  best  suited  to  the  MD  calculations. 

The  fact  that  NN  procedures  can  be  employed  to  inter¬ 
polate  the  computed  derivatives  of  the  potential  directly  pro¬ 
vides  a  significant  advantage  over  Shepard  interpolation 
methods.  To  interpolate  the  force  fields  directly  using  Taylor 
series  expansions  truncated  after  the  quadratic  terms  would 
require  the  evaluation  of  mixed  third  derivatives  of  the  po¬ 
tential  with  respect  to  the  expansion  variables.  These  deriva¬ 
tives  are  generally  not  available  from  electronic  structure  cal¬ 
culations. 

It  should  he  noted  that  the  interpolation  accuracy  of 
feedforward  NN  can  be  easily  increased  by  employing  more 
hidden  layers  and  additional  neurons.  This  is  another  major 
advantage  over  moving  interpolation  methods  using  Shep¬ 
ard's  procedure  and  moving  least -squares  methods.  To  in¬ 
crease  the  accuracy  of  the  Taylor  series  expansions  employed 
in  Shepard  interpolations,  additional  terms  must  be  included 
in  the  expansions.  These  terms  require  the  evaluation  of 
mixed  third,  fourth,  etc.  derivatives  of  the  potential  function, 
which  are  generally  not  available  from  electronic  structure 
calculations.40'47  To  increase  the  accuracy  of  the  moving 
least-squares  procedures,57  58  fl°  the  number  of  basis  functions 
M  must  be  increased  and  the  computational  requirements  of 
the  method  increase  with  M2. 

Neural  networks  have  been  used  successfully  by  Blank 
et  a/.8485  to  fit  data  derived  from  an  empirical  potential 
model  for  CO  chemisorbed  on  a  Ni(  III)  surface.  These  same 
investigators  also  examined  Ihc  interaction  potential  of  H2  on 
a  Si(100)-2x  l  surface  using  a  data  set  comprising  750  en¬ 
ergies  computed  from  local  density  functional  theory.  Hob¬ 
day,  Smith,  and  BcIBruno86  have  used  similar  methods  lo 
investigate  the  energies  of  OH  systems  with  a  NN  compris- 
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mg  five  input  nodes,  one  hidden  layer  with  six  nodes  and  an 
output  layer  The  computed  equilibrium  structures  were  al¬ 
most  identical  with  energies  within  0*08%-0,l4%  of  the 
Brenner10  energies,  Similar  success  with  NN  has  been  re¬ 
ported  by  Tafcii  et  a/.87  and  by  Gassner  et  at .88  for  torsional 
energies  in  large  biological  systems  and  for  the 
Ht0-A13+-H20  system*  respectively.  Brown,  Gibbs,  and 
Clary*'*  have  employed  a  NN  with  32  hidden  nodes  to  de¬ 
scribe  the  ah  initio  potential  surface  the  (HF)2  van  der  Waals 
complex.  The  error  in  the  complex  dissociation  energy  com¬ 
pared  to  the  ah  initio  results  was  0.048  kJ  mol"1. 

In  each  of  these  cases,  the  dimensionality  of  the  system 
was  six  or  less  and  only  one  reaction  channel  was  investi¬ 
gated.  These  factors  significantly  decrease  the  difficulty  of 
the  fining  problem.  The  results  of  the  present  study  show  that 
even  when  the  dimensionality  of  the  system  becomes  large 
and  many  reactions  channels  are  open,  neural  networks  can 
still  be  effectively  employed  to  represent  ab  initio  potential 
hypcrsurfaces  provided  they  arc  combined  with  appropriate 
sampling  and  initiation  methods. 

IV.  RESULTS 

We  have  tested  the  feasibility  of  our  proposed  methods 
for  gas- phase,  chemical  reaction  dynamics  and  machining 
applications  using  the  unimolccular*  gas-phase  dissociation 
of  vinyl  bromide  and  orthogonal  cutting  of  a  silicon  work- 
piece  as  our  prototypes.  Each  of  these  systems  tests  our  pro¬ 
posed  methods  with  its  own  distinct  set  of  computational 
difficulties.  The  vinyl  bromide  potential  is  a  12-dimensional 
surface.  Dissociation  at  an  internal  energy  of  6,44  eV  in¬ 
volves  four  distinct  dissociation  channels,  bromine  atom  dis¬ 
sociation,  hydrogen- carbon -atom  bond  scission,  three -center 
HBr  elimination  and  three-center  H2  elimination.  The  four- 
center  H2  and  HBr  dissociation  channels  have  previously 
heen  found  to  be  of  minor  importance/17  70  The  NN  fits  to  the 
ab  initio  energies  and  force  fields  must,  therefore,  represent 
each  of  the  channels  accurately  if  the  product  yields*  disso¬ 
ciation  rates,  and  vibrational  energy  distributions  for  the 
various  reaction  products  are  to  be  correctly  predicted,  Sili¬ 
con  machining  presents  a  very  different  set  of  problems.  The 
energies  and  force  fields  for  clusters  containing  up  to  seven 
silicon  atoms  with  98  electrons  must  be  accurately  repre¬ 
sented.  Moreover,  the  high-energy*  violent  nature  of  the  cut¬ 
ting  experiment  ensures  that  many  of  these  clusters  will  be  in 
configurations  that  arc  very  far  from  equilibrium.  Therefore, 
the  methods  must  be  able  to  handle  a  very  wide  range  of 
structures  with  sufficient  accuracy  to  permit  meaningful  MD 
and  MC  calculations  to  be  executed, 

A.  Vinyl  bromide  system 

To  execute  the  first  step  in  the  iterative  sampling  process 
described  in  Sec.  HI  B,  we  have  employed  the  empirical  hy¬ 
brid  potential  surface  previously  developed  by  Rahaman  and 
Raff.70  An  ensemble  comprising  nonreactive  trajectories  and 
ones  that  result  in  three-center  HBr  or  H3  dissociation* 
hydrogen-atom,  and  bromine -atom  dissociation  were  se¬ 
lected  for  the  initial  sampling.  Configuration  points  along 
these  trajectories  were  recorded  at  equally  spaced  time  inter- 
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FIG,  6,  Atom  label*  for  vinyl  bromide, 


vals  to  obtain  I4()0  points.  The  MP4(SDQ)  energies  and  as¬ 
sociated  force  fields  for  these  configurations  were  computed 
on  an  AMD  Athlon™  XP  21Q0+  machine  with  a  1.87  GHz 
clock  speed.  The  calculations  employed  a  6-3lG(d,p)  basis 
set  for  the  carbon  and  hydrogen  atoms  and  Huzinaga's 
(4333/433/4)  basis  set  augmented  with  split  outer  s  and  p 
(43321/4321/4)  orbitals  and  a  polarization  /  orbital  with  an 
exponent  of  0,5  for  the  bromine  atom.  The  atom  labels  used 
in  the  study  are  shown  in  Fig,  6, 

Prior  to  NN  fitting,  the  data  obtained  from  the 
MP4(SDQ)  calculations  were  scaled  to  lie  in  the  range 
[-I*  +  l]  using 

(8, 

^  max  /'min 

where  Ks  is  the  scaled  result  and  Kmin  and  are  the  mini¬ 
mum  and  maximum  values  of  the  potential  energy  or  the 
associated  derivatives  in  the  database.  After  dividing  the  daLa 
into  training,  validation,  and  testing  sets*  a  NN  with  12  in¬ 
puts*  20  hidden  neurons,  and  one  output  (12-20-1)  was  filled 
to  the  scaled  data  using  the  Levenberg-Marquardt 
algorithm.78  The  NN  employed  sigmoid  transfer  functions 
and  linear  output  functions.  The  scaled  input  data  are  much 
more  closely  approximated  by  the  NN  sigmoid  transfer  func¬ 
tions.  which  range  from  -  I  to  +1,  than  the  unsealed  values. 
Early  stopping79  was  used  to  avoid  overfitting.  The  fitting 
process  required  approximately  six  minutes  of  computational 
time.  This  is  in  sharp  contrast  to  the  months  of  effort  needed 
to  fit  complex,  multidimensional  empirical  surfaces.7” 


RG.  7.  Comparison  of  oh  tniiio  MP4(SDQ)  calculations  (plotted  poini*) 
with  the  predictions  of  the  NN  (smooth  curve)  for  the  C=C-  stretching 
potential  of  vinyl  bromide. 
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RG  S.  Comparison  of  ah  initio  MP4(SDQ)  calculations  (plotted  points) 
with  the  predictions  of  Ihe  NN  (smooth  curve)  for  the  C-Br  stretching 
potential  of  vinyl  bromkJe. 


FfC.  10.  Comparison  of  ah  initio  MP4(SDQ)  calculations  (plotted  points) 
with  the  predictions  of  the  NN  (smooth  curve)  for  the  C=C-Ht]l>  bending 
potential  of  vinyl  bromide. 


Figures  7  anti  8  show  the  accuracy  of  the  NN  fits  for 
C=C  and  Cf2,-Br,  bond  stretching,  respectively.  Figure  9 
shows  the  variation  of  the  potential  for  stretching 

motion.  The  plotted  points  in  the  figures  are  the  MP4(SDQ) 
results;  the  curve  is  the  prediction  from  the  neural  network. 
As  can  be  seem  ihe  agreement  is  excellent.  Figures  10  and  11 
show  the  results  for  variations  in  the  C— C-H*3*  and 
C==C— Br  angles,  respectively.  Figures  12  and  13  illustrate 
the  accuracy  of  the  NN  fitting  for  variations  of  the 
Ht4,-C=C-Br  and  Ht5)-C=C-HU!  dihedral  angles,  re¬ 
spectively.  In  each  case,  the  neural  network  provides  an  ex¬ 
cellent  hi  to  the  ab  initio  database. 

Figure  14  shows  a  comparison  of  ihe  MP4(SDQ)  scaled 
energies  and  the  corresponding  NN  results  for  the  test  set  of 
vinyl  bromide  configurations.  If  the  NN  fit  were  perfect,  al! 
points  would  fall  on  the  45a  line  shown  in  the  figure.  The 
computed  rms  deviation  of  the  NN  and  quantum  mechanical 
energies  is  0.092  kJ  mol"1.  Clearly,  the  fit  is  nearly  exact. 
Comparison  with  the  data  given  in  Table  It  for  the  six-atom 
FUG+HOC*  system  shows  the  rms  error  using  the  Shepard 
interpolation  method  was  38  times  greater  than  that  for  the 
vinyl  bromide  system,  which  has  four  open  reaction  channels 
as  opposed  to  two  for  the  H2G+HOC+  surface.  Figure  15 


shows  the  distribution  of  the  deviations  between  the  ab  initio 
and  NN  results.  The  points  give  the  computed  distribution. 
The  smooth  curve  is  a  least-squares  fit  of  the  Lorentzian 
function 


P(A£)  = 


a 

{AE-  0)2  +  tr2  * 


(9) 


where  AE  is  the  difference  between  the  scaled  ab  initio  and 
NN  potential  energies,  and  a,  ft,  and  u  are  fitting  parameters. 
The  fit  shown  in  Fig.  15  has  »=8.01X  10"*,  /?=!.43X  KT4. 
anti  fr=  1 .67  X  Hr4, 


B.  Sis  system 

We  have  also  obtained  an  ab  initio  potential -energy  hy- 
pcrsurface  for  five-atom  silicon  clusters  that  occur  in  the 
simulation  of  a  cutting  experiment  with  a  diamond  tool  and 
silicon  workpiece.  As  previously  noted,  the  very  high  ener¬ 
gies  present  in  a  cutting  experiment  significantly  increase  the 
size  of  the  subset  of  configuration  space  that  must  be 
sampled  to  obtain  an  accurate  potential -energy  hypersurfacc. 
In  effect,  there  are  multiple  "reaction  channels"  open.  There- 


HG.  0.  Comparison  of  ah  initio  MP4(SDQ)  calculations  (plotted  points) 
with  die  predict  ions  nf  die  NN  (smooth  curve)  for  the  stretching 

potential  of  vinyl  bromide. 


FIG.  H,  Comparison  of  ah  initio  MP4(SDQ)  calculations  (plotted  points) 
with  the  predictions  or  the  NN  (smooth  curve)  for  the  D=C-Br  bending 
potential  of  vinyl  bromide. 
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H(4)-C=C-Br  Dihedral  Angle/degrees 

FJGr  12.  Comparison  of  ah  initio  MP4(SDQ)  calculations  (plotted  points) 
with  the  predictions  of  the  NN  (smooth  curve)  for  variations  of  the 
H£*]-C=C-Br  dihedral  angle  of  vinyl  bromide 

fore,  this  system  is  an  extremely  demanding  lest  of  our  meth¬ 
ods. 

A  Tersoff  potential  was  employed  in  the  initial  step  of 
the  iterative  process,  MD  cutting  simulations  with  a  +15° 
rake  angle,  10°  clearance  angle,  l  A  cutting  depth.  54.3  A 
cutting  width,  at  a  cutting  speed  of  491.2  m  s"1  were  carried 
out.  The  total  number  of  atoms  in  the  workpiece  was  1675. 
Throughout  the  MD  simulations,  the  configurations  of  the 
five-atom  silicon  clusters  that  are  present  in  front  of  the  tool, 
in  the  chip,  and  within  a  few  unit  cell  distances  in  the  work- 
piece  beneath  the  tool  are  stored. 

Subsequent  to  the  MD  calculations,  the  energies  and 
forces  fields  for  each  of  the  18  000  stored  configurations  are 
computed  using  density  functional  theory  (DFT)  writh  a  6- 
3 1 G  *  *  basis  set  and  the  B 3 LYP  procedure  for  incorporating 
correlation  energy.66  In  executing  these  calculations,  it  is  im¬ 
portant  to  take  symmetry  into  account.  The  energy  and  force 
fields  for  any  given  Si5  cluster  arc  obviously  unaltered  if  two 
or  three  silicon  atoms  are  exchanged.  However,  the  neural 
network  does  not  know  this.  Therefore,  we  must  either  train 
the  NN  to  recognize  symmetry  by  including  all  permutations 
of  identical  atoms  in  the  database  or  adopt  a  procedure  that 
makes  this  unnecessary.  Since  there  are  5!  permutations  for 


M(5)-C=C-H(3)  Dihedral  Angle/degrees 

FIG  13.  Comparison  of  ah  initio  MP4(SDQ)  calculations  (plotted  points) 
with  the  predictions  of  (he  NN  (smooth  curve)  for  variations  of  the 
dihedral  angle  of  vinyl  bromide. 


RG.  14.  Comparison  of  scaled  [Eq.  (8)]  ah  initio  MP4(SDQ)  energies  with 
I  he  predictions  of  the  NN  for  a  test  set  of  aboul  140  configurations  If  die  fit 
were  perfect,  all  points  would  fall  on  the  4 5D  line  shown  in  the  figure  The 
standard  deviation  of  I  he  points  from  this  line  corresponds  to 
0.092  kl  moT*. 

the  Si5  system,  we  have  adopted  ihe  latier  approach.  The 
Z- matrix  input  describing  the  Si5  clusters  comprise  four 
Si-Si  bond  distances,  three  Si-Si-Si  angles,  and  two  dihe¬ 
dral  angles.  Before  each  calculation,  we  list  the  four  bond 
distances  in  increasing  order  of  iheir  deviation  from  the 
Si-Si  equilibrium  bond  length.  This  listing  determines  the 
Z-mairix  input  vector  for  the  bond  angles  and  dihedral 
angles.  When  the  neural  net  is  utilized  to  interpolate  energies 
or  force  fields,  the  same  ordering  of  the  input  vector  is  used. 
In  effect,  this  procedure  obviates  the  need  to  train  the  NN  to 
recognize  symmetry  and  simplifies  the  overall  training  pro¬ 
cess. 

The  database  obtained  from  the  cutting  simulations  was 
augmented  with  an  additional  10  000  five-atom  Si5  configu¬ 
rations  near  equilibrium.  These  configurations  were  obtained 
by  placing  thermal  energy  corresponding  to  a  temperature  of 
300  K  in  the  silicon  workpiece  and  then  following  the  vibra¬ 
tional  motions  of  the  lattice  using  molecular  dynamics  with 
the  Tersoff  empirical  potential  for  silicon.36  Five- atom  con¬ 
figurations  were  stored  at  equally  spaced  time  intervals  dur- 


FIG.  15.  Distribution  of  the  deviations  of  the  predictions  of  ihc  NN  from  the 
ah  initio  MP4(SDQ)  energies  for  vinyl  bromide.  A  Lola  I  of  140  resting  con¬ 
figurations  arc  included.  The  computed  results  arc  shown  as  plotted  points 
The  smooth  curve  is  a  least  squares  fit  of  I5q,  f9)  to  these  data. 
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ing  the  calculations.  The  energies  and  force  fields  for  these 
configurations  were  obtained  using  DPT  and  the  Z-matrix 
ordering  previously  described. 

In  the  second  phase  of  the  procedure,  the  ab  initio  po¬ 
tential  energies  for  each  of  the  28  000  configurations  are 
scaled  using  Eq.  (8).  The  scaled  database  is  then  divided  into 
training,  validation,  and  testing  sets.  Approximately  10%  of 
the  data  are  used  for  the  validation  set,  10%  for  the  testing 
set,  and  the  remainder  is  the  training  set,  A  (9-45-1)  NN  is 
fitted  to  the  data  in  the  training  set  using  the  Levenberg- 
Marquardt  algorithm.78  The  NN  employed  sigmoid  transfer 
functions  and  linear  output  functions. 

To  avoid  overfilling,  it  is  essential  to  employ  an  early 
stopping™  procedure.  In  this  method,  the  rms  deviation  of 
the  NN  output  from  the  data  in  the  validation  set  is  computed 
after  each  iterative  training  cycle  in  the  NN  fining  process. 
At  first,  this  rms  deviation  will  decrease  as  the  number  of 
training  cycles  increases  because  the  accuracy  of  the  fit  is 
improving.  At  some  point,  however,  overftuing  of  the  train¬ 
ing  set  will  begin.  While  overfilling  will  improve  the  appar¬ 
ent  accuracy  of  the  fit  to  the  training  data,  it  will  increase  the 
rms  deviation  of  the  NN  output  to  the  validation  data  since 
these  data  arc  not  included  in  the  training.  The  procedure, 
therefore,  is  to  continue  the  training  process  until  the  rms 
deviation  of  the  NN  output  from  the  validation  data  attains  a 
minimum  value.  The  training  process  is  stopped  at  that  point. 
We  have  also  incorporated  Bayesian  regularization8081  to  im¬ 
prove  network  performance. 

The  entire  training  process  for  Sis  with  a  database  of 
22  400  configurations  required  about  16  h  of  CPU  time.  We 
have  found  that  this  can  be  reduced  to  about  8  h  by  using  a 
conjugate  gradient  fitting  method71  initially  and  then  switch¬ 
ing  to  the  Levenberg-Marquardi  algorithm78  about  halfway 
through  the  fitting.  Comparison  of  the  networks  obtained  by 
the  two  procedures  showed  that  they  produce  nearly  the 
same  interpolated  energies  and  force  fields  for  Si5,  It  is  inv 
portam  to  note  that  these  fitting  processes  require  little  or  no 
human  intervention.  The  computational  effort  needed  is  at 
least  two  orders  of  magnitude  less  than  that  required  to  fit 
multidimensional  empirical  surfaces  containing  over  100  ad¬ 
justable  parameters.™  The  reduction  in  human  time  is  even 
greater. 

Figure  16  shows  a  comparison  of  the  DFT  scaled  ener¬ 
gies  and  the  corresponding  NN  results  for  the  testing  subset 
of  the  28  000  Si5  configurations.  If  the  NN  fit  were  perfect, 
all  points  would  fall  on  the  45*  line  shown  in  the  figure.  The 
computed  rms  deviation  of  the  NN  and  DFT  energies  is 
0.36  kJ  moF1.  This  level  of  accuracy  is  comparable  to  that 
achieved  using  Shepard  interpolation  methods  for  three-  and 
four-atom  systems  and  significantly  belter  than  the  corre¬ 
sponding  accuracy  reported  for  a  six -atom  system  with  two 
open  reaction  channels  using  Shepard  interpolation.^ 

The  next  step  in  the  procedure  is  to  utilize  the  NN  to 
execute  AtMD  calculations  during  which  we  employ  novelty 
sampling  techniques  to  obtain  additional  configurations  to 
iteratively  improve  the  potential -hypersurface  until  an  ac¬ 
ceptable  degree  of  convergence  to  the  final  potential  and 


FIG.  16.  Comparison  of  seated  [Eq,  (8)]  ab  initio  DFT  energies  with  the 
predictions  of  the  NN  for  the  testing  subset  of  28  OOQ  Si*  con  ft  gu  rations.  If 
the  fit  were  perfect,  all  points  would  fall  on  the  45°  line  shown  in  the  figure 
The  standard  deviation  of  the  points  from  (his  line  corresponds  to 
0,36  U  mof1. 

force  fields  is  obtained.  To  illustrate  the  method,  we  have 
examined  the  vibrational  motions  of  a  silicon  lattice  at 
800  K, 

Figure  17(a)  shows  the  distribution  of  minimum  dis¬ 
tances,  P{Tml  for  the  initial  set  of  configurations.  When  Eq, 
(3)  with  7|=0,5  and  F'„=0,06  is  applied  to  the  configurations 
resulting  from  trajectories  computed  using  the  NN  fit  to  the 
original  data  set,  several  thousand  additional  configurations 
are  identified  whose  distribution  of  rm  values  is  that  shown 
in  Fig.  17(b).  Comparison  of  this  distribution  with  Fig.  16  for 
the  original  configurations  shows  that  the  new  configurations 
lie  in  a  region  of  configuration  space  not  adequately  sampled 
by  the  original  data  set.  These  configurations  are,  therefore, 
incorporated  into  the  overall  database  and  a  new  NN  is  fitted 
to  the  computed  DFT  energies.  The  new  distribution  P{Vm) 
for  the  concatenated  set  of  configurations  is  shown  in  Fig. 
17{c),  During  the  second  iteration,  essentially  no  new  con¬ 
figurations  are  found  whose  Vm  values  do  not  lie  underneath 
the  distribution  shown  in  Fig.  17(c),  The  process  has,  there¬ 
fore,  converged  for  the  test  system.  The  advantage  of  this 
procedure  is  that  it  is  unnecessary  to  compute  dynamics  re¬ 
sults  in  order  to  test  for  convergence. 

V.  SUMMARY 

A  feedforward  NN- novelty  sampling  approach  for  the 
development  of  accurate  ah  initio  potential-energy  hypersur- 
faces  that  can  be  utilized  to  conduct  molecular  dynamics  and 
Monte  Carlo  studies  of  large  systems  with  multiple  open 
reaction  channels  has  been  described.  The  method  integrates 
ab  initio  electronic  structure  calculations  with  trajectory/ 
novelty-sampling  techniques  that  permit  the  critical  regions 
of  configuration  space  to  he  determined.  The  modified 
novelty-sampling  method  involves  the  tight  integration  of 
MD  calculations  with  neural  networks  that  employ  early 
stopping  and  regularization  procedures  to  improve  network 
performance  and  test  for  convergence. 

The  electronic  structure  calculations  utilize  existing  soft¬ 
ware  that  permit  the  accuracy  of  the  quantum  mechanical 
calculations  to  he  scaled  up  as  computational  resources  per¬ 
mit.  The  method  is,  therefore,  robust. 
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FIG.  17,  (a)  Distribution  of  minimum  distances  P{  fj  for  the  initial  set  of 
28  000  Si5  configurations,  (b)  Distribution  of  Tm  values  for  the  new  configu¬ 
rations  resulting  from  the  applications  of  Eqs.  (3)  and  (4)  to  Si5  structures 
computed  using  the  NN  fit  to  the  original  Si5  data  set.  (c)  New  distribution 
of  Tm  values  for  the  concatenated  set  of  configurations  resulting  from  lhe  Si* 
structures  obtained  in  the  original  trajectories  using  the  Tersoff  potential  and 
those  obtained  in  the  first  iteration  on  the  NN  surface. 


The  accuracy  of  the  neural  net  interpolation  is  excel  lent 
with  rms  deviations  of  NN  predictions  from  quantum  me¬ 
chanically  computed  energies  on  the  order  of 
0,09-0.36  kJ  mol-1  for  five-  and  six-atom  systems  with  mul¬ 
tiple  open  reaction  channels.  The  required  computational 
time  for  NN  fitting  is  on  the  order  of  10  min  to  16  h  for  the 
systems  considered  here.  There  is  virtually  no  human  time 
involved  in  the  procedure,  which  is  in  contrast  to  the  large 
human  effort  required  to  fit  arbitrarily  chosen,  parameterized 
functional  forms.  Moreover,  the  NN  fit  is  analytic  so  that 
there  are  no  spurious  discontinuities  in  higher  derivatives 
that  arc  often  produced  by  switching  functions  that  arc  em¬ 
ployed  to  connect  channel  potentials.70 

The  sampling  procedures  easily  allow  new  configura¬ 
tions  in  the  iterative  MD  calculations  to  be  identified  as  such 
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The  energies  and  force  fields  for  these  configurations  are 
computed  and  the  results  incorporated  into  the  total  database 
after  which  a  new  NN  is  fitted.  Moreover,  the  sampling  pro¬ 
cedures  serve  as  an  effective  convergence  lest  that  does  not 
require  the  MD  computation  of  the  reaction  dynamics.  When 
subsequent  MD  calculations  produce  very  few  new  configu¬ 
rations,  convergence  to  the  final  potential  surface  is  essen¬ 
tially  assured. 

The  NN  methods  provide  several  significant  advantages 
over  moving  interpolation  procedures54  employing  Shepard's 
interpolation41,42  or  least-squares  methods.57  K  These  in¬ 
clude  (1)  increased  interpolation  accuracy,  (2)  procedures  to 
increase  interpolation  accuracy  that  are  easily  implemented, 
(3)  the  flexibility  to  fit  the  force  fields  directly  with  no  imple¬ 
mentation  problems,  and  (4)  more  accurate  procedures  for 
initiating  the  method  with  self-starting  methods  readily 
available. 
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A  new  approach  involving  neural  networks  combined  with  molecular  dynamics  has  been  used  for 
the  determination  of  reaction  probabilities  as  a  function  of  various  input  parameters  for  the  reactions 
associated  with  the  chemical- vapor  deposition  of  carbon  dimers  on  a  diamond  (100)  surface,  The 
data  generated  by  the  simulations  have  been  used  to  train  and  test  neural  networks.  The  probabilities 
of  chemisorption,  scattering,  and  desorption  as  a  function  of  input  parameters,  such  as  rotational 
energy,  translational  energy,  and  direction  of  the  incident  velocity  vector  of  the  carbon  dimer,  have 
been  considered.  The  very  good  agreement  obtained  between  the  predictions  of  neural  networks  and 
those  provided  by  molecular  dynamics  and  the  fact  that,  after  training  the  network,  the 
determination  of  the  interpolated  probabilities  as  a  function  of  various  input  parameters  involves 
only  the  evaluation  of  simple  analytical  expressions  rather,  than  computationally  intensive 
algorithms  show  that  neural  networks  are  extremely  powerful  tools  for  interpolating  the 
probabilities  and  rates  of  chemical  reactions.  Wc  also  find  that  a  neural  network  fits  the  underlying 
trends  in  the  data  rather  than  the  statistical  variations  present  in  the  molecular-dynamics  results. 
Consequently,  neural  networks  can  also  provide  a  computationally  convenient  means  of  averaging 
the  statistical  variations  inherent  in  molecular-dynamics  calculations.  In  the  present  case  the 
application  of  this  method  is  found  to  reduce  the  statistical  uncertainty  in  the  molecular-dynamics 
results  by  about  a  factor  of  3.5.  ©  2005  American  Institute  of  Physics,  [DOI:  10.1063/1 .2131069] 


l  INTRODUCTION 

The  use  of  neural  networks  (NNs)  to  predict  an  outcome 
(or  the  output  results  as  a  function  of  a  set  of  input  param¬ 
eters)  has  been  gaining  wider  acceptance  with  the  advance  in 
computer  technology  as  well  as  with  an  increased  awareness 
of  the  potential  of  NN,1-7  A  neural  network  is  first  trained  to 
learn  the  underlying  functional  relationship  between  the  out¬ 
put  and  the  input  parameters  by  providing  it  with  a  large 
number  of  data  points,  where  each  data  point  corresponds  to 
a  set  of  output  and  input  parameters.  The  data  required  to 
train  a  NN  may  be  obtained  from  experimental  results  or  by 
computer  simulations.  For  example,  one  can  perform  ah  ini¬ 
tio  calculations  to  compute  the  interaction  potential  for  a 
system  of  atoms  as  a  function  of  internal  coordinates  for  a 
large  number  of  configurations,  and  then  these  data  can  be 
used  to  train  a  NN  to  provide  the  interpolated  potential  en¬ 
ergy  for  any  other  unknown  configuration, Recently,  for 
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multiatomic  systems  such  as  vinyl  bromide  and  a  Sis  cluster. 
Raff  et  a/.s  have  shown  that  such  a  procedure  leads  to  inter¬ 
polated  values  of  the  potential  with  a  high  level  of  accuracy. 

Sumpter  and  Noid16  demonstrated  the  use  of  NN  to  map 
the  vibrational  motion  derived  from  the  vibrational  spectra 
onto  a  potential -energy  surface  (PES)  with  relatively  high 
accuracy.  In  another  application,  Sumpter  et  ai 17  trained  a 
NN  to  learn  the  relation  between  the  phase-space  points 
along  a  trajectory  and  the  mode  energies  for  stretching,  tor¬ 
sion.  and  bending  vibrations  of  H202.  Likewise,  Nami  et 
a/.1H  demonstrated  the  use  of  NN  to  determine  the  TiOi 
deposition  rates  in  a  chemical- vapor  deposition  (CVD)  pro¬ 
cess  from  the  knowledge  of  a  range  of  deposition  conditions. 

In  view  of  the  success  achieved  in  obtaining  interpolated 
values  of  the  PESs  for  multiatomic  systems  using  a  NN 
trained  by  the  ah  initio  energy  values  for  a  large  number  of 
configurations,  it  is  reasonable  to  ask  if  we  can  successfully 
compute  the  interpolated  value  of  a  reaction  rate  or  reaction 
probability  for  a  chemical  reaction  using  a  NN  trained  by  the 
data  obtained  by  molecular-dynamics  (MD)  simulations.  To 
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be  specific,  we  consider  here  an  example  of  the  interaction  of 
a  carbon  dimer  C2  with  a  diamond  (100)  surface.  Due  to  its 
importance  in  the  growth  of  nanocrystal  line  diamond,  the 
study  of  the  C2 -diamond  surface  system  has  been  an  area  of 
considerable  interest,19-30  In  the  present  investigation,  we  ex¬ 
plore  the  validity  of  the  results  provided  by  a  trained  NN. 
For  the  training  of  the  NNs,  we  obtain  data  by  performing 
MD  simulations  to  compute  chemisorption,  scattering,  and 
desorption  probabilities  as  a  function  of  the  direction  of  the 
initial  velocity  vector  ( 0 ,  <£),  impact  parameter  (b)T  transla¬ 
tional  energy  (Etrans),  and  rotational  energy  (ErsH)  of  C2  for  a 
given  lattice  temperature  (see  the  next  section  for  a  definition 
of  these  parameters).  We  then  test  the  trained  NN  by  com¬ 
paring  the  predictions  provided  by  the  NN  with  MD  simula¬ 
tions  for  various  input  conditions. 

In  Secs,  il  and  III,  we  present  the  details  of  the  MD 
simulations  and  the  NN  fitting  procedure,  respectively.  The 
results  are  presented  and  discussed  in  Sec.  IV  and  are  sum¬ 
marized  in  See.  V. 

it,  MOLECULAR-DYNAMICS  SIMULATIONS 

Various  aspects  of  the  MD  simulations  presented  in  this 
investigation  are  the  same  as  those  used  in  our  previous  gas- 
surface  scattering  studies,31"37  A  total  of  324  atoms  are  used 
to  model  the  system.  Out  of  these,  282  atoms  of  diamond 
substrate  are  used  to  model  the  (100)  crystalline  face  with  40 
atoms  of  hydrogen  on  the  top  layer  of  the  diamond  surface 
and  2  atoms  in  the  C?  dimer.  The  BRENNERMP  code  used 
employs  the  potential  given  by  Brenner  el  at.  with  van  der 
Waals  interactions  incorporated  using  a  Lennard -Jones  (LJ) 
potential  The  (x,  v)  coordinates  of  the  atoms  on  the  top  three 
layers  are  as  shown  in  Fig.  1(a)  (for  atoms  of  top  layer,  z 
-0).  Each  of  the  carbon  atoms  on  the  top  layer,  except  the 
central  atom  and  boundary  atoms,  is  capped  by  a  hydrogen 
atom.  The  equations  of  motion  are  integrated  using  Gear's 
predictor- corrector  method39  with  a  time  step  size  At  of 
0,5  fs. 

The  results  of  a  given  trajectory  depend  upon  a  multi¬ 
tude  of  input  variables.  These  include  b,  6.  £mlt  E,ian<.  the 
initial  orientation  of  the  C2  dimer,  ihe  angle  defining  the  C2 
rotational  plane,  the  initial  C2  vibrational  energy  and  its 
phase,  the  temperature  of  the  system,  and  all  of  the  variables 
that  define  the  vibrational  phases  of  the  diamond  surface. 
Here,  8  denotes  the  angle  of  incidence,  i.c.,  the  angle  be¬ 
tween  the  direction  of  the  initial  translational  velocity  vector 
of  the  C i  dimer  and  the  perpendicular  on  the  surface  (Z 
direction).  The  impact  parameter  b  is  defined  as  the  distance 
between  the  location  of  the  central  atom  C  [see  Figs,  1(a) 
and  1(h)]  and  the  point  of  intersection  P  of  the  initial  veloc¬ 
ity  vector  and  the  diamond  surface,  as  shown  in  Fig  1(b).  $ 
represents  the  angle  between  the  line  CP  and  X  axis,  as 
shown  in  Fig.  1(b),  and  £fjll  and  E[ram  refer  to  the  rotational 
and  translational  kinetic  energies  of  the  dimer,  respectively. 

In  this  investigation,  wc  have  focused  our  efforts  on  the 
determination  of  the  dependence  of  probabilities  for  chemi¬ 
sorption,  scattering,  and  desorption  upon  b ,  8 ,  £rnt,  and 
£lr;ins.  The  initial  C2  vibrational  energy  is  set  equal  to  the 
zero-point  energy  and  the  temperature  of  the  lattice  is  main* 


Y 


<b> 

HG,  t,  (a)  The  projection  of  the  top  three  layers  of  carbon  atoms  of  dia¬ 
mond  ( 100)  Lattice  on  XV  plane:  first  layer  fO),  second  layer  (□).  and  third 
layer  {*).  The  central  mom  C  denotes  the  radical  site,  (b)  A  schematic 
diagram  to  depict  parameters  b  and  <fr,  C  denies  the  radical  site  and  P 
rc  presen  is  the  point  of  intersection  of  the  initial  C2  velocity  vector  and  the 
diamond  surface, 


tained  constant  at  600  K  using  the  Be  rend  sen  thermostat 
procedure,40  For  a  given  input  set  of  b .  8.  <f>,  ErtH>  and 
the  probabilities  of  different  reactions  are  determined  by  run¬ 
ning  50  trajectories  to  effect  the  averaging  over  the  remain¬ 
ing  variables. 

Ilf.  NEURAL  NETWORKS 

Using  MD  simulations,  we  first  computed  the  reaction 
probabilities.  Px(ai MD)  ( x=C ,  5,  and  D  which  correspond 
to  chemisorption,  scattering,  and  desorption,  respectively), 
for  a  given  set  of  input  parameters,  a=a{b.  8, 4> ,£„*,£<, anJ. 
By  running  such  calculations  for  different  values  of  <*,  we 
have  computed  N  sets  of  values  of  Px{ar.MD).  These  N  sets 
of  data  points  are  used  to  train  the  two-layer  neural  network,1 
as  shown  in  Fig.  2. 

For  training,  i.e,  for  the  determination  of  b\*\ 
and  ^T)  matrices,  we  used  the  same  procedure  as  described 
in  our  previous  workK  except  for  the  use  of  the  tan-sigmoid 
function  in  the  present  study  in  place  of  log -sigmoid  func¬ 
tion,  Further,  in  this  study,  to  obtain  the  best  fit,  wc  chose  the 
number  of  neurons  (n)  in  the  hidden  layer  to  be  50,  From 
such  training,  for  each  reaction,  we  determined  hl*\ 
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FIG.  2,  A  schematic  illustration  of  the  two- layer  neural  network  used  in  the 
present  study. 

W^\  and  h{f  matrices  of  dimension  (riX5),  (nX  l),  (I  x«), 
and  (1x1),  respectively.  These  matrices  can  be  used1  to 
compute  Pt  for  any  desired  value  of  a  using  the  following 
equations; 


P,(a;  NN)  =  +  b{{\ 

(1) 

where 

a\x)  =  tanhf/i^1]. 

(2) 

with 

ni’1  =  W^a  + 

(3) 

Here,  it*'**  and  a  represent  matrices  of  dimensions 
(n  X  1),  (n  X  l)f  and  (5xl),  respectively,  and  the  elements  of 
matrix  a  are  the  input  parameters  fr,  6 ,  &  £ra,  and  £ It 
may  be  added  that  for  Ihc  sake  of  belter  accuracy  of  ihe 
neural  network  at  the  lime  of  training,  we  have  linearly 
scaled*  the  input  and  output  parameters  in  the  range  of  -I  to 
1;  after  computing  PM(aiW)  using  Eq.  (1),  we  performed 
the  reverse  scaling  procedure  to  obtain  the  actual  F/aiNN), 

IV,  RESULTS  AND  DISCUSSION 
A,  Various  reactions 

We  noted  the  occurrence  of  the  following  reactions: 

*  +  C2  — 1 ►  *  -  Ci  (chemisorption)  [reaction  (Rj)], 

*  +  — *  *  4*  C 2  (scattering)  [reaction  (R2)], 

and 

*  +  C2  * - C 2  — *  *  +  C2  (adsorption  and  desorption) 

[reaction  (R3)]. 

The  bonds  not  specified  in  these  equations  remain  un¬ 
changed.  Here  *  refers  to  the  central  carbon  atom  on  the  top 
surface.  It  may  be  noted  that  no  hydrogen  atom  is  bonded 
wiih  this  central  carbon  atom,  ti  is,  therefore,  a  radical  site. 

In  addition  to  these  channels,  we  have  also  noticed  a 
reaction  in  which  C2  *s  inserted  into  two  surface  carbon  at¬ 
oms: 

[reaction  (R4)], 

Here  *'  and  *"  denote  two  neighboring  carbon  atoms  on  the 
diamond  surface,  and  H  denotes  a  hydrogen  atom  bonded 
with  ihe  carbon  atom.  All  other  bonds  remain  unchanged.  H 
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Time  steps 


FIG.  3.  (a)  The  potential  energy  (V)  of  the  system  (in  eV).  lb)  7  coord i mile 
of  the  center  of  mass  of  carbon  dimer  {in  A),  and  (c)  the  imemudear  dis¬ 
tance  (fl)  between  the  carbon  atoms  of  the  dimer  (in  A),  as  a  function  of  the 
integration  time  step  for  a  typical  trajectory  leading  to  chemisorption.  One 
time  step  corresponds  to  0,5  fs. 

has  been  noted  that  reaction  (R4)  occurs  with  very  low  prob¬ 
ability.  Therefore,  we  confine  our  attention  to  reactions 
(RiMRj)  only. 

The  potential  energy  of  the  system,  the  Z  coordinate  of 
the  center  of  mass  of  C2*  and  the  dimer  bond  length  as  a 
function  of  time  for  a  trajectory  undergoing  reaction  (R,)  are 
shown  in  Fig.  3.  Wc  note  a  drop  in  the  potential  energy  due 
to  chemisorption,  a  drop  in  the  value  of  Z,  and  a  modification 
in  the  vibrational  frequency  and  amplitude  of  C2  as  the 
chemisorption  takes  place.  The  plots  for  a  trajectory  under¬ 
going  reaction  (R3)  are  shown  in  Fig.  4.  From  Fig,  4{b)  we 
note  that  Z  first  decreases,  reaches  a  minimum,  remains 
nearly  constant  for  some  lime,  and  then  begins  to  increase 
due  to  reverse  journey  of  C2  If  we  compare  the  potential 
variation  given  by  Figs,  3(a)  and  4(a),  we  clearly  note  the 
difference  that  in  the  case  of  reaction  (R^)  ihc  change  in  the 
potential  energy  of  the  system  is  very  small  compared  to 


Time  steps 


FIG.  4.  fa)  The  potential  energy  (I/)  of  Ihe  system  {in  eV)  and  (b)  Z  coor¬ 
dinate  of  the  center  of  mass  of  carbon  dimer  (in  A),  as  a  function  of  ihe 
integration  lime  slep  for  a  (ypical  trajectory  leading  to  reaction  (R^  One 
lime  step  corrcs ponds  10  0.5  fs. 
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FIG.  5,  The  probabihiies  of  m  event  (ehcmitiorpiion/scattering/tJesoiptioii)  given  by  neural  network  vs  the  corresponding  probabilities  given  by  molecular 
dynamics  for  [raining  and  icsting  data  sets:  [(a)  and  (b>]  chemisorption,  [(c)  and  (d)j  scattering,  and  ((e)  and  (f)]  desorption. 


reaction  (R,).  For  reaction  (R2),  we  also  obtain  curves  (not 
shown)  similar  to  those  in  Fig,  4*  except  that  Z  starts  rapidly 
increasing  after  reaching  its  minimum  value. 

B.  Training  and  testing  of  the  neural  networks 
1 .  Training  by  a  single  set  of  data 

We  have  computed  the  probability,  Px(a,MD)  [jc=C  S , 
and  D  corresponding  to  reactions  (Rj)~(R3)]t  of  a  reaction 
for  a  given  set  of  input  parameters.  a-a(bm  £tra(U), 

by  running  50  trajectories  during  which  we  average  over 
other  variables  including  the  initial  phases  of  vibrahon  of  C2 
and  the  lattice,  and  the  initial  orientation  of  C2  bond  vector. 
The  vibrational  energy  of  C2  and  temperature  of  the  lattice 
are  kept  fixed  in  all  the  calculations  reported  in  this  work. 
We  computed  PJ^aMD)  for  1900  different  values  of  a.  Out 
of  these  1 900  data  points.  1 500  data  points  are  used  for  train¬ 
ing,  200  for  validation,  and  200  for  testing.  We  used  the 
training  set  to  train  a  NN  to  predict  the  probabilities  of  reac¬ 
tions  for  all  values  of  at,  within  the  range  of  a  of  the  training 
set.  We  also  employed  an  early  stopping  procedure  and  vali¬ 
dation  data  set  to  prevent  overfilling.41 

The  initial  success  of  the  NN  training  is  shown  in  Fig.  5 
(top  plots)  where  we  plot  1500  values  of  PJf*;NN)  given  by 
the  NN  versus  the  corresponding  probabilities.  PJor.MD). 
given  by  the  trajectory  calculations  which  were  used  to  train 
the  neural  network.  Next,  for  200  values  of  a,  we  compute 
P,(a;  NN)  using  Eqs.  ( 1 )— (3)  and  compare  these  values  with 
the  corresponding  values  given  by  MD  simulations  (see  bot¬ 
tom  plots  of  Fig.  5).  The  results  of  the  testing  set  are  very 
similar  to  those  for  the  training  set,  which  show  good  gener¬ 
al  i/ation  capability  of  the  network. 

For  the  purpose  of  the  following  discussion,  it  is  appro¬ 
priate  to  describe  the  relationships  among  three  related  reac¬ 
tion  probabilities;  P((ar;MD).  P^oriNN),  and  Pt{a),  (For 


brevity,  when  the  specification  of  some  labels  is  not  essen¬ 
tial,  we  may  ignore  one  or  more  of  the  labels  among  .v.  or, 
MD,  and  NN).  Px(a;MD)  is  the  reaction  probability  for  pro¬ 
cess  Jt  computed  from  the  trajectory  results  using 

Px(*'MV)  =  NJNt>  (4) 

where  Nx  is  the  number  of  trajectories  that  resulted  in  reac¬ 
tion  x  and  Nr  is  the  total  number  of  trajectories  computed. 
/%(£*)  is  the  underlying  true,  but  unknown,  classical  reaction 
probability  for  process  ,v  on  the  potential -energy  surface  used 
in  the  calculations.  These  two  probabilities  are  related  by 

Px(a)=  lim  P,(a;MD),  (5) 

NT —  * 

When  Nr  is  finite,  there  will  be  a  random  statistical  fluctua¬ 
tion  present  in  P^aiMD)  that  will  cause  its  value  to  deviate 
from  Pt(a)<  This  fluctuation  will  be  such  that  ^“95%  of  the 
time,  we  expect  to  have 

P»  -  2 A  =£  p>;MD)  Px(a)  +  2A,  (6) 

where42 

i*[^(a)(l-/>J(Qt)}/ATrl1'2.  (7) 

Finally,  PJo^NN)  is  the  reaction  probability  predicted  by 
the  NN  for  process  x  [see  Eq.  ( 1 )]. 

As  mentioned  earlier.  Figs.  5(a)-5(f)  show  plots  of 
Pt(MD)  vs  P*(NN)  for  chemisorption,  scattering,  and  de¬ 
sorption.  respectively,  with  NT=50  in  the  training  set  (top 
plots)  and  testing  set  (bottom  plots)  If  the  agreemeni  be¬ 
tween  Pa(MD}  and  P*(NN)  were  to  be  perfect,  all  points 
would  lie  on  the  45c  lines  shown  in  the  plots.  In  the  analysis 
that  follows,  wc  shall  demonstrate  that  most  of  the  deviation 
of  the  plotted  points  from  the  45°  line  is  due  to  the  statistical 
variations  present  in  P*(MD)  rather  than  to  inaccurate  inter- 
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FIG.  6.  The  expected  statistical  spread  of  the  probabilities  Pf(cr;MD)  as 
given  by  Eq.  (6)  vs  (he  true  probabilities  Pt{a}. 

po l aii on  of  the  neural  net.  This  conclusion  is  based  on  the 
variation  of  Pt(a;MD)  with  Px(a)  computed  using  Eqs,  (6} 
and  (7).  This  result  is  shown  in  Fig.  6. 

A  comparison  of  the  spread  of  data  From  the  ideal  45° 
line  in  Fig,  5(a)  or  Fig.  5(b)  with  that  shown  in  Fig,  6  shows 
that  they  match;  95%  of  the  points  in  these  curves  are  within 
distances  of  0.08,  0.10,  and  O.t  lT  respectively,  from  the  45° 
line.  This  shows  lhat  Ihc  expected  statistical  spread  of 
/**(<*;  MD)  values  is  very  close  to  that  observed  in  the  plots 
of  Pr(a;MD)  vs  P,(ar;NN)  shown  in  Fig,  5.  In  effect 
Pjr(cr;NN)  is  filling  Px(a)  and  not  the  statistical  fluctuations 
in  PJccMD),  This  observation  indicates  that  neural  net  fit¬ 
ting  of  ihe  MD  results  provides  a  rapid  and  effective  means 
of  averaging  out  the  statistical  fluctuations  present  in 
Px(a;MD)  when  Nr  is  not  very  large. 

The  NN  with  an  early  stopping  procedure41  ,A}  tends  to  fit 
the  trend  and  not  the  statistical  fluctuations.  This  observation 
is  not  a  new  one.  It  has  been  shown  that  the  number  of 
training  iterations  corresponds  to  the  complexity  of  the  neu¬ 
ral  network  function  44  Therefore,  as  training  continues,  the 
complexity  of  the  network  function  increases  and  it  becomes 
more  likely  lhat  ihe  network  will  begin  to  fit  statistical  fluc¬ 
tuations,  rather  than  the  underlying  Line  probability  function. 
By  determining  when  the  error  on  the  validation  set  begins  to 
increase,  we  can  stop  the  training  at  an  optimal  point,  before 
overfilling  occurs.  It  is  trivial  to  visualize  that  wilh  overfit¬ 
ting.  in  Fig.  5.  the  spread  in  the  training  plots  would  shrink 
and  that  in  the  testing  plots  would  increase. 

To  check  further  that  the  spreads  in  the  plots  of  Fig.  5  are 
mainly  due  to  the  statistical  variability  associated  with  the 
results  of  the  MD  calculations,  we  compute  /\(or;MD)  by 
running  500  trajectories  instead  of  50,  for  each  value  of  of. 
This  is  expected  to  decrease  the  statistical  fluctuations  in 
/%(**:  MD)  by  a  factor  of  3,2.  The  plots  of  such  probabilities, 
Pt(MD)  vs  P1(NN),  are  shown  in  Fig.  7.  It  may  be  men¬ 
tioned  that  the  Pt(NN)  values  have  been  determined  here  by 
using  the  same  sets  of  Wj,  />,,  and  b2  as  we  used  to 
obtain  Fig,  5.  The  decreased  spread  of  the  points  around  the 
45d  line  suggests  that  the  same  NN,  which  was  obtained  by 
fining  the  less  accurate  MD  data,  is  in  belter  agreement  with 
the  more  accurate  MD  data.  This  point  again  indicates  that 
the  NN  fitting  procedure  involving  early  stopping  fits  the 


FIG.  7,  The  probabilities  of  an  even!  tchcinisorpiion/scattenng/desoipuon) 
given  by  neural  networks  vs  ihe  corresponding  probabilities  given  by  mo¬ 
lecular  dynamics  using  sets  of  500  trajectories:  (a)  chemisorption,  (b)  scat¬ 
tering,  and  (c)  desorption. 

trend  of  the  input  data  and  not  the  statistical  fluctuations 
associated  with  the  input  data.  This  implies  that  statistical 
error  present  in  the  MD  results  may  be  most  conveniently 
reduced  using  NNs  rather  than  by  computing  additional  tra¬ 
jectories. 

A  comparison  of  the  spread  in  Figs,  5{b)  and  7(a)  shows 
that  there  has  been  a  decrease  in  the  spread  by  a  factor  of  2.0, 
The  reduction  in  the  spread  by  a  factor  of  2,0,  instead  of  3,2, 
indicates  the  effect  of  error  (e^)  associated  with  the  neural 
network  fitting. 

To  estimate  the  deviation  associated  with  the  NN  fitting, 
we  employed  the  same  1900  data  points  50  times  to  get  50 
different  trained  NN  such  that  each  network  has  been  trained 
with  different  random  initial  weights.  In  Lwo-sigma  limit,  the 
variation  of  such  50  values  of  P^or.NN)  from  the  mean 
value  for  each  of  the  1900  values  of  a  has  been  determined. 
The  average  of  such  variations  is  found  to  be  equal  to  0,04. 

Thus,  we  may  consider  two  limiting  cases.  When  the 
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FIG.  8.  The  variation  of  chemisorption  probability  (Pf).  scattering  probabil¬ 
ity  [P$),  and  desorption  probability  (£,>)  as  a  function  of  impact  parameter 
[bl  for  £tfm*0.06  eVH  £,*=0.052  eV.  0=  17*,  and  W.  The  triangles 
(A)  joined  by  the  line  denote  the  NN  ensemble  predictions  and  the  circles 
(•')  represent  the  MD  data.  The  error  bars  associated  with  the  MD  points 
correspond  to  one-sigma  limit. 

statistical  error  (eMD)  in  the  input  MD  data  is  very  small  as 
compared  to  we  would  expect  some  spread  in  figures 
similar  to  Figs.  5{a)-5(f)  due  to  error  When  eMD  is  very 
large  as  compared  to  eNlv,  then  the  probabilities  predicted  by 
the  NN  are  expected  to  be  more  accurate  than  those  given  by 
MD,  and  the  spread  of  points  from  the  45°  line  in  a  figure 
similar  to  Fig.  5  would  be  dose  to  The  results  shown  in 
Figs.  5(a>— 5(f),  6.  and  7  show  that,  for  the  present  applica¬ 
tion,  we  have  eMC>^erNN. 

2 Ensemble  (committee)  of  networks 

To  improve  the  network  accuracy,  we  used  the  1900  data 
points  previously  described  to  train  an  ensemble  of  / 
networks.45  In  our  computations,  wc  used  /=5Q.  Each  net¬ 
work  in  the  ensemble  was  trained  using  a  different  random 
partitioning  of  the  data  into  1600  training  data  and  300  vali¬ 
dation  data.  In  addition,  each  network  was  trained  with  dif¬ 
ferent  random  initial  weights.  Thus,  for  /  networks  in  the 
ensemble,  we  obtain  /  sets  of  h\x\  and  matri¬ 
ces,  After  all  networks  in  the  ensemble  have  been  trained,  we 
apply  the  same  input  to  each  network  and  average  all  the 
responses.  The  results  obtained  using  this  method  are  given 
in  Sec.  IV  B  3.  The  ensemble  average  produces  a  more  ac¬ 
curate  response  than  any  one  member  of  the  ensemble,45  It 
averages  out  the  variations  in  individual  network  responses 
due  to  randomness  in  sampling  of  the  data  and  randomness 
in  the  choice  of  initial  weights  and  biases  provided  by  train¬ 
ing. 

The  results  obtained  for  the  C2-di amend  (100)  surface 
system  are  examples  of  the  advantage  provided  by  analysts 
of  the  daia  using  NN  ensembles.  Equations  (6)  and  (7)  show 
that  the  two-sigma  limit  statistical  uncertainty  in  values  of 
P,(«; MD)  is  ±28,3%  when  Px{a)  is  0.50  and  NT  is  50.  In 
contrast,  the  corresponding  statistical  uncertainty  present  in 

Downloaded  19  Dec  2005  to  139.76  79  49  Redistribution  subject 


FIG.  9,  Same  as  Fig.  B  exccpi  that  these  curves  show  variation  with  for 
£M=Q  Q52.eV1  b=  1.0  A,  8-  1 1°,  and  10*  (left  curves),  and  that  with 
£ftll  for  E,„Bi“0.06  eV,  b=  1,0  A.  0=  IV,  and  =  1 10°  (right  curves). 


the  results  obtained  using  the  NN  ensemble  is  about  ±8%, 
This  statistical  error  reduction  of  a  factor  of  about  3,5  is 
achieved  with  very  little  additional  computational  effort, 

3.  Dependence  of  reaction  probabiiities  on  input 
parameters 

A  major  advantage  of  NNs  can  be  realized  by  determin¬ 
ing  the  probability  of  Pr(ar;NN)  corresponding  to  any  value 
of  a  within  the  range  of  b>  ft  0,  and  for  which  the 
NN  has  been  trained.  For  example,  in  almost  negligible  time 
{as  compared  to  the  time  required  to  run  tens  of  thousands  of 
trajectories)  we  can  find  the  dependence  of  Ft(cr;NN)  on  any 
variable,  say  bt  for  any  fixed  value  of  ft  0,  Emt.  and  E^m.  As 
an  example.  Figs.  8(a)— 8(c)  show  such  variations  of  Pc*  P$, 
and  P0  as  a  function  of  h  for  0=  17°,  0=310°,  E ^ 
=0.052  eV,  and  E^-0.06  cV  given  by  the  NN  ensemble. 
For  comparison,  the  results  obtained  by  trajectory  calcula¬ 
tions  are  also  shown  in  the  figures.  We  note  that  the  agree¬ 
ment  between  the  NN  and  MD  results  is  very  good.  Simi¬ 
larly,  the  computed  MD  results  and  NN  predictions  of 
variation  of  Fc.  Fs,  and  Pp  as  a  function  of  Etnm  and  Em  for 
fixed  values  of  other  parameters  are  compared  in  Fig.  9,  and 
the  results  shown  in  Fig.  10  give  the  corresponding  curves 
when  0  and  0  are  varied.  All  these  curves  show  that  the 
agreement  between  the  MD  results  and  NN  predictions  is 
very  good. 

it  is  trivial  to  exhibit  the  utility  of  such  a  trained  NN.  As 
an  example,  in  Fig.  1 1  we  show  the  variation  of  P0  F$,  and 
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PIG,  10.  Same  as  Fig.  8  except  thill  these  curves  show  variation  with  ft  for 
£„„t=O.I24.  Em = 0,05 2  eV,  A  =  1.0  A,  and  <6=  1 10s  (lefl  curves),  und  dial 
with  <t>  for  £lr<1B=0tl6  eV,  £„,=0.052  eV.  *=1.0  A,  and  S=ll'  (right 
curves), 

PD  with  angle  9  predicted  by  the  NN  ensemble  for  different 
values  of  impact  parameters  for  some  fixed  values  of  <£,  Erol, 
and  EKini.  It  is  easy  to  visualize  that  any  number  of  such 
curves  for  the  dependence  of  F0  PSw  and  PD  on  any  of  the 
parameters  can  similarly  be  obtained  in  a  CPU  time  negli¬ 
gible  compared  to  the  time  required  to  obtain  the  same  re¬ 
sults  by  explicit  MD  calculations, 

V*  SUMMARY  AND  CONCLUSIONS 

We  have  computed  the  probabilities  of  various  reactions 
associated  with  the  chemical- vapor  deposition  (CVD)  of  car¬ 
bon  dimers  on  a  diamond  (100)  surface  using  MD  simula¬ 
tions*  The  probabilities  of  chemisorption,  scattering,  and  de¬ 
sorption  asafunctionofinput  parameters  (rotational  energy, 
translational  energy,  angles  9  and  associated  with  the  in¬ 
cident  direction  of  the  initial  velocity  vector  of  the  carbon 
dimer,  and  impact  parameter)  have  been  employed  to  train 
and  test  the  neural  networks  (NNs).  The  results  of  the  study 
can  be  summarized  as  follows, 

(1)  By  using  a  large  number  of  data,  the  values  predicted 
by  the  trained  NN  arc  found  to  be  in  very  good  accord 
with  those  provided  by  MD, 

(2)  l\  has  been  noted  that  after  training  the  NN,  the  deter¬ 
mination  of  the  interpolated  probabilities  as  a  function 
of  the  input  parameters  involves  only  the  evaluation  of 
simple  analytical  expressions.  Thus,  the  present  study 
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6  (degree) 


FIG.  II.  The  variation  of  chemisorption  probability  [Pc},  scattering  prob¬ 
ability  (Ps).  and  dcsorpiion  probability  tP0)  predicted  by  the  NN  ensemble 
as  a.  function  of  angle  ft  for  different  values  of  impact  parameter:  1,0 f*},  1,5 
<*).  2.0  (A).  and  2.5  A  (fl).  for  E^-0,06,  E^-0.052  eV,  and  4>=  I IQ*. 

shows  that  interpolation  by  using  a  NN  can  signifi¬ 
cantly  reduce  the  burden  of  performing  a  large  number 
of  expensive  MD  calculations. 

(3)  It  has  been  observed  that  the  NN  fitting  procedure  in¬ 
volving  an  early  stopping  procedure  fits  the  trend  of  the 
input  data  and  not  the  statistical  fluctuations  associaled 
with  tf^Jinput  data.  Thus,  NNs  can  also  provide  a  com¬ 
putationally  convenient  method  of  averaging  out  statis¬ 
tical  variations  inherent  in  the  MD  calculations. 

These  observations  thus  show  that  NN  can  be  an  ex¬ 
tremely  powerful  tool  for  interpolating  the  probabilities  and 
rates  as  a  function  of  different  parameters  in  various  chemi¬ 
cal  reactions. 
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Molecular  dynamics  simulations  of  silicon  melling  reported  in  ihe  literature  using  Tcrsoff  s  potential  give 
melting  temperatures  about  50%  higher  than  the  experimental  value.  To  address  this  discrepancy,  we  have 
proposed  a  modification  to  the  values  of  the  parameters  of  the  Tersoff  potential.  This  modification  involves  a 
change  in  the  magnitude  of  3  of  the  12  parameters  in  the  Tcrsoff  potential  as  an  effective  means  for  bringing 
down  the  melting  point  close  to  the  experimental  value.  The  melting  point  is  determined  by  performing  Monte 
Carlo  simulations  using  the  empirical  void  -  nucleated  melting  procedure.  In  addition  to  an  agreement  of  the 
computed  melting  poim  with  the  experiment,  the  modified  parameters  also  bring  the  density  of  the  liquid  and 
the  solid  at  the  melting  point  into  good  agreement  with  the  experiment  without  significantly  altering  the  density 
and  energy  values  of  the  solid  crystal  at  room  temperature.  The  coordination  number  and  specific  heat  of  the 
liquid  are  also  found  to  be  in  better  agreement  with  the  experiment  when  the  modified  set  of  parameters  is 
used.  A  comparison  of  density  over  a  wide  range  of  temperatures  shows  that  the  density  of  the  solid  predicted 
by  the  Tcrsoff  potential  with  the  modified  parameters  is  larger  than  that  given  by  the  unmodified  parameters. 
This  difference,  however,  is  not  appreciable  at  room  temperature;  it  increases  with  temperature  and  is  about  I  % 
at  the  melting  point.  The  computed  cohesive  energy  of  the  solid,  the  point  defect  energy  corresponding  to  a 
vacancy,  and  the  surface  energy  values  for  (100)  and  (HI)  surfaces  are  also  found  to  be  nearly  the  same  for  the 
modified  as  well  as  the  unmodified  parameters, 
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L  INTRODUCTION 

The  exploration  of  empirical  intermolecuJar  interaction 
potentials  for  a  system  of  silicon  atoms  has  been  a  subject  of 
considerable  interest,1-23  One  of  the  most  popular  potentials 
for  silicon  is  Ihe  Tcrsoff  potential1'4  It  has  been  shown  to 
explain  various  properties  of  silicon  crystals.  However,  Cook 
and  Clancy23  have  found,  and  Yoo  et  al1A  have  recently  con* 
firmed,  that  the  Tersoff  potential4  predicts  the  melting  point 
to  be  nearly  50%  higher  than  experiment.  This  observation 
suggests  the  necessity  of  introducing  some  modification  in 
the  parameters  of  the  Tcrsoff  potential  that  will  provide  bet’ 
ter  agreement  of  ihe  predicted  melting  point  with  expcii- 
menl. 

It  is  well  known  that  molecular  dynamics  (MD)  simula¬ 
tions  of  melting  of  a  perfect  crystal  with  periodic  boundary 
conditions  produce  superheating  due  to  the  lack  of  free  sur¬ 
face  and  defects.  A  large  number  of  computational  studies 
show  that  the  mechanical  melling  temperature  at  which  a 
crystal  without  any  defects  and  free  surface  melts  is  about 
20%  higher  than  the  true  Lhermodynamic  melting  point  Tm. 
The  ratio, 

f~Tjr,  (i) 

is  found  to  be  —0.8  for  a  large  number  of  systems.  For 
example,  the  computed  values  off  for  Ar  over  a  wide  range 
of  pressure  (0-53  GPa)  is  found  to  be  0.85 ±0.03, 25-37  and  in 
the  range  0.77-0,81  Tor  Ne.2H  Cu.39  CH3N02.30  ammonium 
nitrate,31  and  ammonium  dinilramide.32  Using  kinetic  analy¬ 
sis  of  the  homogeneous  nucleation  behavior  for  melting  of 
superheated  crystals,  Lu  and  Li33  found  /  in  the  range  0,82- 


0.85  for  Ag,  Afi  Atfi  Co,  Cu,  Fe,  Mm  Ni,  Pd,  Pi.  and  Sb,  and 
lower  values  of  /for  metals  with  low  melting  points,  i.c,, 
0.78  for  Sn  and  0,77  for  Pb,  It  is,  therefore,  of  interest  to 
determine  if  the  computed  value  of  /  for  a  covalent  crystal, 
such  as  Si  also  falls  in  the  same  range. 

Another  point  of  interest  in  connection  with  the  melting 
of  silicon  is  ihe  testing  of  an  empirical  method,  namely,  void  - 
nucleated  melting,  for  the  determination  of  the  melting  poim. 
The  void-nucleated  method  for  determining  Tm  is  the  result 
of  empirical  observations  in  various  studies; 25-34  Detailed  in¬ 
vestigations  of  defect-nucleated  melling  of  covalent  and  me¬ 
tallic  solids  have  been  reported  by  Phiilpot  et  aiu  and 
Lutsko  et  ai 29  The  MD  simulations  of  Lutsko  et  aL  of  a 
lattice  containing  nearly  1000  atoms  of  copper  show  that  an 
ideal  crystal  melts  al  a  temperature  over  200  K  above  the 
thermodynamic  melting  temperature.  Solca  et  a/.2528  studied 
the  void- nucleated  melling  method  to  determine  the  melting 
point  of  Ar  and  Ne,  They  noted  that  with  the  increase  in  the 
void  size,  the  melting -temperature- versus- void -size  curve 
first  exhibits  a  decrease  and  then  attains  a  plateau  region  that 
corresponds  to  the  true  melting  temperature,  and  for  a  void 
size  larger  than  that  corresponding  to  the  plateau  region,  the 
melting  temperature  plummets. 

For  Ar,  at  various  values  of  pressure  ranging  from 
0  to  53  GPa,  Agrawal  et  aL2t  have  confirmed  this  observa¬ 
tion  by  comparing  the  melting  poim  so  determined  with 
those  computed  by  Zha  et  ol*5  using  a  thermodynamic 
method  on  the  same  potential.  By  simulating  three  types  of 
voids  in  this  study,26  it  has  also  been  noted  that  melting  tem¬ 
perature  depends  on  the  number  of  atoms  removed  and  not 
upon  the  number  of  voids  or  the  shape  of  the  voids,  tn  a 


!  098-01 21/2005/721  1 2J/1 2520fi{  1 1  J/S23.00 


1 25206-1 


©2005  The  American  Physical  Society 


206 


AGRAWAL.  RAFF,  AND  KOMANDUKl 


PHYSICAL  REVIEW  R  72,  125206  12005) 


subsequent  study  on  the  melting  of  mtromethanc,  Agrawal  et 
d/.30  obtained  good  agreement  between  the  melting  point 
computed  from  the  plateau  region  in  the  mcl  ting¬ 
le  mpe  rat  ure^  versus- void -size  curve  and  that  determined  by 
the  method  based  on  the  equilibration  of  a  mixture  of  liquid 
and  sol  id. M 

Besides  Ar,  Ne.  and  CH3NO2,  the  void- nucleated  melt¬ 
ing  method  has  also  been  tested  for  CH3COOH,37  cycloiri- 
methylcne  trinitraminc  (RDX).38  1,3,3-lrimtroazetidine 
(TNAZ),38  ammonium  nitrate,31  and  ammonium  dinitra- 
mide.32  The  method,  however,  is  empirical  in  lhat  there  is  no 
theoretical  basts  to  validate  the  assumption  that  the  melting 
temperature  in  the  plateau  region  is  the  true  melting  poinh 
Therefore,  it  may  be  interesting  to  see  if  this  void-nucleated 
melting  method  is  applicable  for  a  covalent  system,  such  as 
Si  where  we  have  the  results  of  other  studies2324  for  com¬ 
parison. 

In  this  paper,  we  report  the  results  of  computations  of  the 
melting  point  of  silicon  using  the  Tersoff  potential4  and  the 
void-nucleated  method.  The  results  provide  additional  infor¬ 
mation  related  to  the  validity  of  the  void-nucleated  method 
and  the  value  of  /  for  Si.  In  addition,  the  study  results  in  a 
modification  of  the  parameters  of  the  Tersoff  potential4  lhat 
gives  a  melting  point  in  agreement  with  experiment. 


TABLE  1,  The  modified  and  unmodified  parameters  of  the  Ter¬ 
soff  potential. 


Tersoff 

parameters 

Tersoff-4' 

Tersoff-ARK 

A  (eV) 

1.8308  X  I G3 

1.8308  X  101 

B  (eV) 

4.71 18  x  10* 

4.7 1 1 8 x  10* 

k  (A‘J) 

2.4799 

2,4799 

&  (A-1) 

1.7322 

1.7322 

0 

1,1  X  IQ'* 

U5x  10  * 

n 

0.78734 

0,98# 

c 

1,0039  X  105 

1,0039X10* 

d 

16.217 

16,2t7 

h 

-0.598  25 

-0J4S  25 

R  (A) 

2.7 

2.7 

5(A) 

3.0 

3.0 

X 

L0 

L0 

^Reference  4, 


Tersoff-2,  and  Tersoff- 3  potential,  respectively.  The  nota¬ 
tions,  Tersoff- 2  and  Tersoff- 3  are  consistent  with  those  used 
by  Balamane  et  at?9 


II.  COMPUTATIONAL  DETAILS 
A.  Force  field 


We  have  investigated  the  Tersoff  potential 4  which  is  ex¬ 
pressed  in  terms  of  12  parameters  as  follows: 

Wr=  <1/2)2  Vy. 

>*t 

(2) 

where 

V(J  =fc(ri])[A  exp(-  Krh)  -  btjB  cxp(- 

(3) 

X(l  +  (?%rU2\ 

(4) 

(,i~  2  fMik)g(oijk). 

(5) 

EL  Monte  Carlo  simulations  and  melting 
/,  NPT  simulations 

Monte  Carlo  (MC)  simulations  at  constant  pressure  (P) 
and  constant  temperature  (7)  for  a  fixed  number  of  atoms 
(N)  as  described  in  Ref.  40  are  performed  to  simulate  melt¬ 
ing.  A  3  x  3  x  3  supercell  of  Si  crystal  having  216  atoms  has 
been  considered.  The  cubic  simulation  box  has  an  edge 
length  of  L  Periodic  boundary  conditions  have  been  used.  At 
every  cycle  during  the  MC  walk,  N+ 1  moves  were  at¬ 
tempted.  These  correspond  to  a  random  move  of  each  of  N 
atoms  and  a  random  change  in  the  size  of  the  simulation  box. 
The  coordinates  and  box  size  have  been  varied  according  to 
the  following  standard  equations  given  in  Ref.  40: 

f^(new)  =  <^(old)  +  (2£|  -  1  )(&?)„,„,  (8a) 


=  l  +  c2fd2  -  e2/[d2  +  (/1  -  cos  8jjk)2 ],  (6) 


and 


fMih 


l  for  r{j<Rf 

|  1  costTrfr^  -  R)/(S  -  /?)]  for  R  <  rtj  <  S, 
0  for  rif  >  5, 


(7) 


The  parameters  of  the  potential,  as  given  in  Ref.  4  arc  listed 
in  Table  I. 

For  the  purpose  of  discussion,  we  shall  denote  the  above- 
mentioned  potential  as  Viv  or  the  Tcrsoff-4  potential  to  dis¬ 
tinguish  it  from  other  similar  potential  functions  with  differ¬ 
ent  sets  of  parameters  given  by  Tersoff,1-3  Wc  denote  those 
given  by  Tersoff  in  Refs.  1-3  by  Vj,  V|j,  and  Vm  or  Tersoff- 1 . 


L(new)  -L(old)  +  (2fe  -  (8b) 

where  and  are  random  numbers  chosen  uniformly  in  the 
interval  0  to  l,  and  {A^)maJl  and  (A L)^  are  the  maximum 
step  sizes  for  the  translation  of  the  Cartesian  coordinate  ( q ) 
of  an  atom  and  box  length  (L),  respectively.  These  maximum 
displacements  have  been  determined  to  obtain  an  acceptance 
ratio  of  about  50%  from  the  total  number  of  attempted 
moves  and  their  values  were  (A</)majl=0J5  A  and  (A L)max 

=0.13  A. 

A  walk  or  a  change  made  in  the  box  size  is  accepted,  if 
either  W<0  or 


exp(-  Wik8T]  >  f,  (9) 

otherwise  the  move  is  rejected.  En  the  above  relation,  kfi  is 
Boltzmann's  constant,  £  is  a  random  number  chosen  uni- 


1 25206-2 


207 


MONTE  CARLO  SIMULATIONS  OF  VOID-NUCLEATED... 


PHYSICAL  REVIEW  8  72.  125206  (2005) 


formly  in  Ihc  inierval  0  to  I,  and  W  is  given  by 

W  =  P(unew  -  vlM)  +  ( VKVI  -  VlM)  +  NkBT  )n(u„]j/t»n(.w), 

(10) 

Here,  the  subscripts  new  and  old  have  the  usual  meaning,  u 
denotes  the  volume  of  the  box,  V  represents  the  total  inter¬ 
action  energy  of  the  system,  and  P  and  T  signify  the  external 
pressure  and  temperature  at  which  the  calculations  were 
made.  When  a  move  is  rejected,  the  properties  corresponding 
to  the  previous  configuration  were  included  in  the  averages, 
A  new  configuration  was  then  generated  from  this  previous 
configuration. 


Temperature 


2,  Creation  of  voids  and  gradual  heating 

Starting  from  the  ideal  configuration  at  7=0  K,  an  equili¬ 
brated  configuration  at  7=500  K  and  P-  1  atm  is  obtained 
by  running  10  000  cycles  of  Monte  Carlo  simulations.  By 
successive  simulations,  each  for  10  000  cycles,  the  equili¬ 
brated  configurations  at  temperatures  1000,  1500 .  and 

3000  K  have  been  obtained.  All  the  simulations  have  been 
run  at  P=  1  atm.  From  this  set  of  configurations,  the  closest 
configuration  at  a  temperature  lower  than  T„  is  simulated  for 
10  000  cycles  to  obtain  an  equilibrated  configuration  at  tem¬ 
perature  Ttr  After  equilibrating  the  system  at  temperature  7P, 
jYV(m1  atoms  have  been  removed  out  of  the  total  Ni}  atoms. 
The  locations  of  these  Nvaiti  atoms  have  been  chosen  ran¬ 
domly  with  the  constraint  that  the  minimum  distance  be¬ 
tween  any  two  is  greater  than  rmjn,  where  rmirt  is  chosen 
nearly  as  large  as  possible.  We  choose  this  number  /V¥Ult)  to 
signify  the  void  size. 

After  the  creation  of  voids,  this  system  of  Af=tf„-A/y{jid 
atoms  is  again  equilibrated  at  temperature  T0  and  pressure  P 
by  running  MC  simulations  for  nn  cycles.  The  heating  of  the 
system  from  initial  tempera  Lure  Ta  is  achieved  by  uniformly 
changing  the  temperature  by  47=0.00 1  K,  after  each  cycle 
of  /V+l  moves.  Thus,  the  temperature  at  the  mh  cycle  of 
moves  is 


r=Trt  +  (H~/OA7>  (II) 

The  gradual  heating  of  the  system  in  the  MC  simulations 
has  been  achieved  by  redefining  the  desired  temperature  in 
Eqs.  (9)  and  (10)  during  constant  number-pressure- 
temperature  (NPT)  simulations.  This  method  of  gradual  heat¬ 
ing  based  on  redefining  the  desired  temperature  in  NPT 
simulations  has  also  been  employed  earlier  for  the  moiling 
studies  of  Ar6  and  nitro methane 30  by  MD  simulations.  The 
success  of  this  procedure  has  been  tested  in  those  studies26'30 
by  monitoring  the  temperature  and  pressure  as  a  function  of 
integration  lime;  there  it  has  been  found  that  pressure  re¬ 
mains  constant  and  the  temperature  changes  according  to  the 
desired  input  temperature.  Here  in  MC  simulations  we  do  not 
have  the  kinetic  energy  to  monitor  the  temperature,  but  we 
have  tested  the  increase  in  temperature  by  monitoring  the 
variation  in  the  potential  energy  with  the  MC  cycle  number 
during  the  heating  process.  Figure  l  depicts  this  variation  in 
the  potential  energy  per  atom  with  7  given  by  Eq,  (11)  in  a 
simulation  corresponding  to  Tif~  1500  K  and  A  7 
=0.001  K /cycle.  Each  point  in  the  figure  has  been  obtained 


FIG,  1 .  Variation  of  potential  energy  per  atom  as  a  function  oF 
temperature  T  given  by  Eq.  (11).  The  straight  line  shows  the  least 
squares  fit, 

by  averaging  the  potential  energy  over  1000  cycles.  The 
straight  line  in  the  figure  corresponds  to  the  least-squares  fit. 
As  expected,  we  note  that  the  slope  of  the  straight  line  is 
equal  to  |  ks  that  shows  that  the  system  is  being  heated 
according  to  the  temperature  given  by  Eq,  (11). 

1*  Order  parameter 

The  melting  of  the  system  has  been  identified  by  monitor¬ 
ing  the  energy  and  the  order  parameter  (f)26^J  as  a  function 
of  temperature  during  the  heating  process.  At  the  melting 
temperature,  these  parameters  change  abruptly.  Since  the  dia¬ 
mond  structure  can  be  considered  as  a  combination  of  two 
fee  structures,  we  can  write  £  as  an  average  of  the  order 
parameters  of  the  two  structures,  i,e,, 

i  2 

02) 


where 


fr-0/tfc) 


Nc 


2 


expO'k  -  Tj) 


(13) 


In  these  equations  Nc=Nf  2,  ij  is  the  position  vector  of  the 
jth  atom  of  (th  fee  structure  and  k  is  the  reciprocal  lattice 
vector.  For  the  fee  lattice,26  40 


k  =  (2ir/tU(-U,-l),  (14) 

where  a{,  is  the  length  of  the  unit  cell.  It  is  trivial  to  see  that 
for  an  ordered  crystal  £— *  1  and  £-*0  for  the  liquid  state. 


C.  MD  simulations 

We  employed  a  larger  system,  a  5X5X5  supcrcell  of  Si 
having  1000  atoms,  to  compute  density,  configurational  en¬ 
ergy.  specific  heat,  coordination  number,  and  radial  distribu¬ 
tion  function.  Such  a  system  has  been  simulated  using 
isothcrmal-isobaric  molecular  dynamics  (NPT-MD)  method. 
The  Mclchionna  modification41  of  the  Nosd-Hoovcr  equa¬ 
tions  of  motion  was  used  to  achieve  the  constant  temperature 
and  pressure.  The  thermostat,  as  well  as  baroslat  relaxation 
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FIG,  2,  Variation  of  order  parameter  (f)  and  potential  energy  per 
aiom  ( V)  as  functions  of  Monte  Carlo  simulation  cycle  number  (n) 
for  the  Tersoff-4  potential  with  initial  temperature  (7J=2950  K, 
void  size  (A/V()iJ)  =  0,  and  i»o=10X  103  cycles. 

time,  has  been  chosen  as  50.0  t.u,  (1  tu.-l.0l8x  I0’u  s), 
and  the  Verlei  leapfrog  procedure40  (with  time  step  0.05  t.u.) 
has  been  used  to  integrate  the  equations  of  motion.  Com- 
puled  results  are  given  in  Figs.  7-9  and  Tables  111  and  V. 

III.  RESULTS 
A.  Melting 

Figure  2  shows  the  variation  of  order  parameter  f  and  the 
configurational  energy  per  atom  V  as  a  function  of  the  Monte 
Carlo  cycle  number  n  for  a  system  of  216  atoms  at  P 
- 1  atm  initially  at  temperature  T„=2950  K  with  no  voids. 
The  heating  according  to  Eq.  (11)  has  been  started  after  nn 
=  10000  cycles.  A  rapid  change  in  £  and  V  depicted  by  these 
curves  at  «  =  I07x  I03  cycles  corresponds  to  the  melting  at 
7=3047  K. 

Figure  3  gi  ves  the  variation  of  the  transition  temperature 
as  a  function  of  the  void  size.  Nvak |,  defined  by  the  number  of 
atoms  randomly  removed  from  the  crystal.  Multiple  points  at 
a  value  of  NvM  in  the  figure  demonstrate  the  effect  of  vary- 


FIG.  3.  Transition  tempera  lure  as  a  function  of  the  void  size 
(Af^ljj)  for  the  Tcrsoff-4  potential.  The  horizontal  tine  shows  the 
average  temperature  (2509  K)  in  the  plateau  region. 
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FIG.  4,  Same  as  Fig,  2.  but  for  different  values  of  void  size 
(NV0ld)  and  initial  temperature  (?„}:  (a)  jVi(*j=l0,  7„-2590  K;  (hi 
14,  r„=2430  K;  (cl  AUd=  16,  7>22Q0  K. 

ing  either  the  initial  phase  or  temperature  of  the  system.  This 
figure  shows  that  the  transition  temperature  decreases  as 
Nvo»i  increases,  A  horizontal  line  in  the  figure  has  been  drawn 
to  mark  the  average  temperature  in  the  plateau  region  corre¬ 
sponding  to  1 1  14,  The  average  temperature  in  this 

region  is  2509  ±44  K.  With  the  assumption  that  the  transition 
temperature  in  the  plateau  region  is  the  melting  temperature 
that  corresponds  to  the  thermodynamic  melting  point,  we 
obtain  2509  K  as  the  melting  point  given  by  this  void- 
nucleated  method,  which  is  in  good  agreement  with  the  com¬ 
puted  values.  2567  K  and  2584  K.  given  by  Yoo  et  at,14  us¬ 
ing  the  same  potential  and  the  method  of  coexisting  solid- 
liquid  phases  and  2547  ±22  K  given  by  Cook  and  Clancy,21 

A  few  typical  curves  showing  the  variation  of  order  pa¬ 
rameter  {£)  and  configurational  energy  (10  for  Nvoil)=  1 0.  14. 
and  16.  res£)ciively,  are  shown  in  Figs.  4(a}~4(c).  For  all 
these  curves.  n0=  10  000  cycles  and  Tri  values  are  2590, 
2430,  and  2200  K,  respectively,  as  indicated  in  the  figure.  A 
comparison  of  the  curves  corresponding  to  Afvo«]=  14  and  16 
shows  that  melting  point  decreases  from  2456  K  for  void 
size  jYVOid=  14  to  less  than  2200  K  for  void  size  16, 

For  the  sake  of  further  discussion;  following  the  previous 
studies,25  26  28  30  we  introduce  a  term,  "critical  void  size."  to 
refer  to  the  maximum  value  of  the  void  size  in  the  plateau 
region.  It  may  be  noted  that  the  critical  void  size  depends  on 
the  substance,  the  interaction  potential  function,  pressure, 
and  the  number  of  atoms  (N)  considered  in  the  simulation 
box.  For  example,  the  studies  of  Agrawal  et  aL2ty  on  the 
melting  of  Argon  reveal  that  at  a  given  pressure  {P 
=44.56  kbar)  the  critical  void  size  increases  from  50  to  120 
as  N  increases  from  864  to  2040. 

In  view  of  various  simulation  studies  on  Ar,  Ne.  CH^NOj. 
RDX,  TNAZ.  ammonium  nitrate,  and  ammonium  dinitra- 
mide,  it  is  not  surprising  to  note  that  the  value  of  the  melting 
point  of  Si  given  by  the  void-nucleated  method  employed 
here  agrees  with  the  previously  reported2124  melting  point 
values. 

However,  one  needs  to  make  a  rigorous  analysis  of  the 
appropriateness  and  limitations  of  this  method.  Due  to  the 
lack  of  understanding  of  the  mechanism  of  melting,  the  re¬ 
lationship  between  the  existence  of  the  above-mentioned  pla¬ 
teau  region  and  the  mechanism  of  melting  is  not  yet  known. 
There  have  been  various  studies  to  explore  the  mechanism  of 
melting.  The  Lindemann  criterion42  of  melting  and  the  Bom 
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TABLE  il  A  comparison  of  melting  temperatures  given  by  the 
modified  (Vv)  and  unmodified  (Vjy)  Tersoff  potential  parameters 
with  the  experiment 


TJ  K) 

Tm  (K) 

f=TJTs 

Tersoff-41 

3075  ±29 

2509  ±44 

0.82 

Tersoff- A  RKb 

2097  ±21 

171  lc 

Experiment 

1687d 

“Reference  4. 
hPrescn  t  mod  i  fication , 

C0bmincd  by  using  Eq.  (!)  and  /=  2509/ 3075. 

JReferenee  49, 

instability  criterion43  of  melting  have  been  of  importance  in 
explaining  the  mechanism  of  melting.  The  investigations  of 
Jin  et  alP  show  a  strong  correlation  between  the  Lindcmann 
and  the  Born  instability  criteria.  The  Lindcmann  criterion 
based  on  the  mechanical  approach  of  melting  has  gained 
more  attention  as  a  result  of  the  recent  investigations  of 
Burakovsky  el  al 44  regarding  the  melting  of  elements  based 
on  the  dislocation  mechanism.44-47  Considering  melting  as  a 
dislocation-mediated  phase  transition,  Burakovsky  et  alAA 
computed  melting  curves  for  24  elements  in  good  agreement 
with  the  available  data.  They  also  note  that  their  dislocation- 
based  melting  approach  leads  to  melting  points  that  are  very 
close  to  those  given  by  the  Linde  matin  criterion.  Gomez  et 
a/,47  showed  that  defects  which  occur  in  the  solid  phase  as 
the  transition  temperature  is  approached  are  responsible  for 
melting.  However*  we  are  still  in  the  learning  phase  in  the 
area  of  the  mechanism  of  melting.  One  would  like  to  rigor¬ 
ously  learn  if  the  existence  of  the  plateau  region  in  the  void- 
size-versus-meking-tempcraiure  curve  is  universaJ  and 
whether  the  melting  point  in  the  plateau  region  corresponds 
to  the  thermodynamic  melting  point. 


B,  Superheating  and  /  factor 

The  extent  of  superheating  of  a  crystal  can  be  measured 
by  the  ratio  /  given  by  Eq.  (1),  Table  11  gives  the  value  of  the 
melting  point  without  any  void,  7j-3075  Kt  determined  by 
averaging  over  four  values  of  the  computed  melting  points 
with  different  initial  conditions.  The  true  or  thermodynamic 
melting  temperature  Tm- 2509  K.  given  by  the  average  of  the 
melting  temperatures  in  the  plateau  region*  as  discussed  in 
Sec.  Ill  A*  leads  to  /=0.82. 

It  is  interesting  to  note  that  this  value  of  /= 0.S2  for  a 
covalent  Si  crystal  is  comparable  to  that  determined  by  the 
homogeneous  nuclcation  model  of  Lu  and  Li33  for  Al*  Au, 
Co,  Cu,  Fe*  Mn.  Ni,  Pb*  Pd*  Pt*  Sb*  and  Sn,  and  those  com¬ 
puted  for  Ar*  Nc,  Cu,  CHjNCL,  RDX,  TNAZ*  ammonium 
nitrate,  and  ammonium  dinitramide.  For  all  these  crystals  /  is 
found  to  be  in  the  range  0.77  to  0.85. 

A  near  constant  value  of  /  for  a  large  number  of  systems 
suggests  the  possibility  of  /^0.80  for  many  other  sub¬ 
stances.  Thus*  the  prediction  of  /  in  this  range  may  be  an 
important  criterion  to  judge  the  validity  of  a  theoretical 
model.  Further*  the  rigorous  determination  of  /by  some  the- 
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oretical  model  would  also  make  the  determination  of  Tm 
easy. 

As  regards  the  effect  of  the  size  of  the  simulation  box  on 
the  factor  /that  signifies  the  degree  of  superheating,  Agrawal 
et  alP  have  found  that  the  value  of  /  for  a  system  of  argon 
crystals  does  not  change  when  the  periodic  system  contain¬ 
ing  864  atoms  is  replaced  by  that  having  2048  atoms.  A 
simulation  of  6912  atoms  of  Ar  by  Jin  et  al11  also  gave 
almost  the  same  value  of  /*  The  agreement  between  the  com¬ 
puted  values  of  /  and  those  given  by  Lu  and  Li35  as  dis¬ 
cussed  in  the  above  paragraphs  also  suggest  the  possibility  of 
insignificant  change  in  the  value  of  /  as  the  number  of  atoms 
in  the  periodic  system  considered  is  increased  by  a  large 
extent.  Also*  based  on  the  Lindcmann  criterion,42  namely,  a 
solid  melts  when  the  amplitude  of  atomic  vibrations  exceeds 
a  certain  fraction  of  the  lattice  spacing*  it  appears  that  the 
melting  temperature  or  superheating  of  an  ideal  crystal  with 
the  periodic  boundary  conditions  would  not  change  when  the 
system  size  is  increased  although  the  limits  of  this  generali¬ 
zation  have  not  been  established. 

C.  Modification  in  the  potential 
/.  Addition  of  long-range  interactions 

In  the  Tersoff  potential*  /c(r^)  occurring  in  Eq.  (3)  is  as¬ 
sumed  to  be  zero  for  r,;>3.0A  for  a  silicon  crystal.  To 
examine  the  effect  of  long-range  interaction,  we  have  added 
a  Lennard-Jones  ( 1 2-6)  interaction, 

Vu(rl})  =  4e[(cr/r,j)n  -  (<r/r0)6],  (15) 

10  (he  Tersoff  potential  Vtv  and  then  determined  the  melting 
point.  The  values  of  parameters,  *r=  3.826  A  and  e 
=0.0 1 7  44  eV  have  been  taken  from  Refs.  48  and  49.  To 
avoid  strong  repulsions  due  to  this  term,  we  considered  this 
interaction  only  when  r(7>4,0  A, 

By  running  a  Monte  Carlo  simulation  with  the  addition  of 
such  a  long-range  interaction,  we  found  that  melting  occurs 
at  31 16  K  when  void  size^O.  If  we  compare  this  value  of  Ts 
with  7^=3075  ±29  K,  given  by  the  Tersoff-4  potential  alone, 
we  infer  that  the  effect  of  the  addition  of  van  der  Waals 
interactions  in  the  Tersoff  potential  on  the  melting  is  very 
small.  Therefore*  we  need  to  look  for  some  more  significant 
modification  to  obtain  a  melting  point  in  agreement  with  the 
experiment. 

2,  Choice  of  new  parameters 

After  running  a  few  sets  of  simulations  by  varying  the 
magnitude  of  3  of  the  12  parameters  in  the  Tersoff  potential, 
it  has  been  observed  that  a  combination  of  0,  n*  and  h  pa¬ 
rameters  occurring  in  Eqs.  (4)  and  (6)  would  give  values  of 
the  melting  point  and  density  of  liquid  Si  in  agreement  with 
experiment  without  introducing  appreciable  change  in  the 
density  and  configurational  energy  of  a  solid  Si  crystal  at  low 
or  room  temperatures.  We  have  not  attempted  any  change  in 
parameters  A,  B ,  A*  /i,  and  ^  as  the  properties  of  the  solid 
even  at  low  temperatures  may  be  very  sensitive  to  these  pa¬ 
rameters,  Similarly,  we  did  not  investigate  the  effect  of  vary¬ 
ing  /?,  *5*  c*  and  d  as  these  parameters  may  not  be  as  effcc- 
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FIG.  5.  Variation  of  order  parameter  (£)  and  potential  energy  per 
atom  (V)  as  functions  of  Monte  Carlo  simulation  cycle  number  (n) 
for  Tersoff-ARK  potential  with  initial  temperature  (rj  =  2000  K, 
void  size  (jV*0 m)=0,  and  /r„=lOX  103  cycles. 

Live,  in  modifying  the  melting  point  as  well  as  the  density  of 
liquid  without  introducing  appreciable  changes  in  the  density 
and  cohesive  energy  of  the  solid  at  low  or  room  tempera¬ 
tures,  as  0,  n.  and  h.  To  achieve  this,  we  first  tested  various 
combinations  of  n  and  h  in  a  coarse  grid  without  changing 
the  value  of  0.  The  values  of  n  and  h  ranging  from  —0.75  to 
L5  times  their  original  values  have  been  attempted  to  repro¬ 
duce  the  melting  point.  A  finer  grid  of  0 ,  n,  and  h  is  then 
explored  to  reproduce  the  melting  temperature,  liquid  den¬ 
sity.  and  cohesive  energy  of  the  solid.  The  values  of  0 ,  n,  and 
h  so  determined  arc  1.15  x  1G-6,  0.988,  and  -0.745  25.  re¬ 
spectively  (see  Table  I).  For  further  discussion,  let  us  denote 
ihcTcrsoff  potential  with  these  modified  parameters  as  VV  or 
Tersoff-ARK  (Agrawal-Raff-Komanduri)  potential.  The  re¬ 
sults  obtained  with  Tersoff-ARK  are  discussed  in  Secs, 
III  C2*-II1  C2e, 

a.  Melting  point ,  Similar  to  the  results  shown  in  Fig.  2  for 
the  Tersoff-4  potential,  wc  get  Fig.  5  for  the  Tersoff-ARK 
potential.  In  this  simulation  T„=2000  K,  n*=  10000  and, 
melting  occurs  at  n  =  79  000  that  correspond  to  melting  at 
2069  K  for  a  crystal  without  any  void.  Figures  6(a)-6{i) 
show  the  configurations  of  the  system  of  216  silicon  atoms  at 
various  stages  of  this  simulation.  Figures  6(a)-6(c)  depict  the 
crystalline  phase.  Figure  6(d)  corresponds  to  /i=79  000  when 
the  crystal  starts  melting  and  the  order  parameter  (£}  just 
begins  to  drop  (see  Fig.  5  for  u  versus  £  curve  for  this  simu¬ 
lation).  Figures  6(e)  and  6(f)  exhibit  the  configurations  at  n 
-85  000  and  90  000,  respectively;  these  belong  to  the  region 
of  the  rapid  drop  in  the  order  parameter  (see  Fig.  5).  Figures 
6(g)_6(i)  represent  configurations  at  n  =  95  000,  100  000,  and 
105  000,  respectively:  this  region  corresponds  to  the  liquid 
characterized  by  low  values  of  the  order  parameter  (see  Fig. 
5), 

By  averaging  the  results  on  a  set  of  four  such  simulations 
with  different  initial  conditions,  wc  get  the  mechanical  melt¬ 
ing  temperature  as  r,-2097±21.  Using  Eq,  (!)  and  assum¬ 
ing  the  value  of  f=Tm fTs =2509/ 3075,  as  described  in  Sec. 
HI  B,  we  obtain  the  melting  temperature,  7^=  1711  K  (see 
Tabic  11),  which  is  in  good  agreement  with  the  experimental 
value  1687  K4'> 
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FIG,  6.  The  configuration  of  silicon  atoms  at  various  stages  of  a 
MC  simulation  at  n  =  ( a)  1000,  (b)  401X10,  (c)  75  000.  (d)  79  000, 
(e)  85  000,  (0  90  000,  (g)  95  000,  fh)  100000,  and  fi>  105  000  for 
Tersoff-ARK  potential  with  initial  temperature  {7^) =2000  K,  void 
size  (AVoid)  =  0.  and  n0- 10  X  [Q3  cycles. 

b.  Density.  Figure  7  compares  the  density  of  a  solid  and  a 
liquid  for  the  Tersoff-ARK  (solid  curve)  and  the  Tcrsoff-4 
(dotted  curve)  potentials.  We  note  that  at  low  temperatures, 
the  densities  of  the  solids  given  by  V|V  and  Vv  are  in  excel¬ 
lent  agreement.  However,  the  density  values  predicted  by  Vv 
are  found  to  be  larger  than  those  given  by  F|V  at  higher 
temperatures.  The  difference  increases  with  temperature.  At 
the  melting  point,  this  difference  is  about  1%,  For  the  liquid, 
wc  see  a  large  difference  in  the  densities  predicted  by  these 
two  potentials.  At  1687  K.  the  density  of  liquid  predicted  by 
Vv  is  found  to  be  2,589  ±0,0 1 3  g  cm‘\  which  is  in  excellent 
agreement  with  the  experimental  density  2.583  g  cnr5.50 

however  gives  a  density  of  2.2?5  g  enr3  for  the  super¬ 
cooled  liquid  at  this  temperature.  It  may  be  noted  that  for  the 
Tersoff-ARK  potential  at  1687  K,  the  ratio,  AV/V,,  of 
change  in  volume  An  to  the  volume  of  solid  Va  at  the  phase 
transition  is  equal  to  -11.8%,  which  is  in  excellent  agree¬ 
ment  with  the  experimental  value  -11.9%, 51 


Temperature  (K) 


FIG.  7.  The  variations  in  densities  of  solid  <•)  anti  liquid  (A)  as 
a  function  of  temperature:  solid  curves  for  Tersoff-ARK  poicntial 
and  doited  curves  for  the  Terosoff-4  potemial. 
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FIG,  8.  (a)  The  difference  in  energy  A  as  defined  by  Eq.  (16); 
(b)  the  value  of  ihc  configurational  energy  per  atom,  as  a  function  of 
density  for  silicon  in  the  ideal  tetrahedral  configuration  at  7=0  K 

c.  Energy.  We  have  compared  configurational  energy  of  Si 
in  the  solid  phase  in  two  ways:  (i)  At  7=0  for  different 
values  of  pressure  or  density,  and  (it)  at  P- 0  for  different 
values  of  tempera  lures.  In  Fig.  8(a),  we  have  shown  the  dif¬ 
ference  in  ihe  two  potentials  by  a  parameter  A  defined  as 

A  =  I0,(Vv-Viv)/V'iv.  (16) 

where  Vv  and  Vrv  are  computed  for  an  ideal  tetrahedral  ge¬ 
ometry  at  7=0  at  different  values  of  pressure,  which  can  be 
specified  by  the  density  of  the  crystal.  Wc  note  that  for  A 
<0.25.  i.e..  the  two  potentials  differ  by  less  Lhan  0.000  25% 
over  a  wide  range  of  pressure,  at  7=0.  In  this  connection,  it 
may  also  be  interesting  to  see  the  behavior  and  absolute 
value  of  per  atom  in  this  configuration;  Fig,  8(b)  gives 
these  data. 

Figure  9  compares  the  configurational  energy  given  by  the 
iwo  potentials  at  various  temperatures.  It  shows  the  variation 
of  configurational  energy  per  atom  as  a  function  of  tempera¬ 
ture  for  Si  in  solid  and  liquid  phases  for  the  Tersoff-ARK 
(solid  curve)  and  the  Tersoff-4  (dotted  curve)  potentials  ai 
7=0.  In  the  case  of  the  solid,  the  results  of  and  Vy  differ 


Temperature  (K) 

FIG.  9.  The  configurational  energy  values  of  solid  (•}  and  liq¬ 
uid  (A)  as  functions  of  temperature:  solid  curves  for  the  Tersoff- 
ARK  potential  and  dotted  curves  for  the  Terosoff-4  potential. 
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by  less  than  0.001  cV  and,  therefore,  the  dotted  curve  corre¬ 
sponding  to  the  potential  Vlv  coincides  with  ihe  solid  curve. 
The  slope  of  the  kinctic-cnergy-vcrsus-tcmpcraturc  curve  for 
the  classical  model  is  trivially  T5  kB.  By  summing  the  slope 
of  the  potential  and  kinetic  energies,  wc  can  obtain  the  spe¬ 
cific  heat  Cp ,  It  is  interesting  to  note  that  in  this  classical 
simulation,  the  average  slope  of  the  potential  energy  curve 
for  the  solid  is  nearly  equal  to  that  given  by  the  kinetic 
energy  curve.  The  average  value  of  C„  in  the  temperature 
range  100-2000  K  for  the  Tersoff-ARK  potential  as  well  as 
the  Tersoff-4  potential  is  found  to  be  25.4  1  mol’1  K1,  which 
is  in  good  agreement  with  the  experimental  values52  at  high 
temperatures:  25.359,  27.196,  and  28,870  J  mol-1  K"1  at  7 
=  800,  1200,  and  1600  K,  respectively. 

As  shown  in  Fig,  9,  in  the  case  of  the  liquid,  we  note  a 
large  difference  between  the  configurational  energy  values 
given  by  the  two  models.  We  also  note  significant  variation 
in  the  slopes  with  temperature.  With  respect  to  the  average 
specific  heal  in  the  range  1687  to  2000  K.  we  obtain  C> 
=  31.0  J  mot"1  K-1  for  Vy,  that  given  by  in  the  tempera- 
lure  range  2500-3000  K  is  38.5  J  mol-3  K_1t  against  the  ex¬ 
perimental  value  of  29.2  J  moF1  K~l  at  7=1687  K  (sec 
Table  III). 

From  the  information  given  in  Fig.  9,  we  can  determine 
the  heat  of  fusion  as  19.3  kJ  mol'1  for  the  Tersoff-ARK  po¬ 
tential,  which  is  in  poor  agreement  with  the  experimental 
value,53  50,6  kJ  mol’1  (see  Table  III).  This  disagreement  sug¬ 
gests  the  need  for  further  improvement  in  the  potential . 

Using  MD  simulations,  the  configurational  energy  (V)  as 
well  as  the  density  for  a  liquid  and  a  solid  at  various  tem¬ 
peratures  have  also  been  computed  using  the  Tersoff-ARK 
potential  on  the  periodic  system  having  216  atoms  in  the 
simulation  box.  The  computed  values  so  obtained  are  found 
to  be  in  excellent  agreement  with  the  corresponding  values 
reported  in  Figs.  7  and  9  for  the  periodic  system  having  1000 
atoms.  Regarding  the  size  dependence  of  the  computed  melt¬ 
ing  point,  Miranda  and  Antonelli60  recently  reported  an 
agreement  between  the  values  of  ihc  melting  point  of  silicon 
given  by  the  periodic  systems  having  216  and  512  atoms  in 
the  simulation  box.  From  the  results  presented  here,  together 
with  the  other  results  available  in  the  literature,  it  appears 
that  the  values  of  the  melting  point,  density,  and  configura¬ 
tional  energy  determined  in  the  present  studies  are  size  inde¬ 
pendent,  at  least  in  the  range  of  atoms  considered:  although, 
the  limits  of  this  generalization  have  yet  to  be  established. 
Whether  or  not  the  degree  of  overheating  of  the  periodic 
system  with  an  extremely  large  number  of  atoms  (say  on  the 
order  of  a  billion  or  more)  would  change  due  to  the  change 
in  the  lattice  vibrations  remains  to  be  seen,  Admittedly,  simu¬ 
lations  of  such  large  systems  require  very  fast,  highly  pow¬ 
erful  computers  with  adequate  memory.  The  availability  of 
such  systems  (single  systems  or  with  parallel  processing  ca¬ 
pability)  in  the  not  too  distant  future  can  answer  this  question 
more  definitely. 

d.  Palm  defect  and  surface  energies.  In  Table  IV,  we  have 
reported  ihe  computed  values  of  the  point  defect  and  surface 
energy  values  given  by  the  Tersoff-4  and  Tersoff-ARK  po¬ 
tentials  for  vacancy,  hexagonal  interstitial,  and  tetrahedral 
interstitial  defects,  and  ( 1 00}  and  (III)  surfaces.  For  deter¬ 
mining  the  point  defect  energy  values,  we  have  considered  a 
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TABLE  111.  Comparison  of  various  potentials  for  Si  (ail  simulation  values  are  reported  ai  the  melting 
point  of  the  respective  potential  model). 


MEAM4 

swh 

Tersoff-4c 

Tersoff- ARKC 

Expt. 

Melting  point  (K) 

1475 

1691 

2509 

1 7 1 1 

1687d 

Density  of  liquid 
( g  mL 1 ) 

2.302 

2.459 

2.225 

2.589 

2.583' 

Heat  of  fusion  at 
melting  (kJ  mol-1) 

35,0 

30*9 

39.7 

193 

50, 6h 

Cp  of  liquid 
(J  mol"1  K~{) 

32.3 

35*3 

—38.5 

—31*0 

29.2h 

Liquid  coordination 
number 

6*0-8. 2 

5. 5-6*2 

-4.7 

-5*4 

-6.4b 

aData  taken  from  Ref.  23. 
bData  taken  from  Ref.  53. 
'Present  work. 


dDala  taken  from  Ref.  49, 
cData  taken  from  Ref.  50. 


system  of  216  atoms  with  the  periodic  boundary  conditions 
and  have  added  or  subtracted  one  atom  to  make  it  an  inter¬ 
stitial  defect  or  vacancy.  The  system  is  allowed  to  relax  be¬ 
fore  computing  the  energy.  For  a  comparison,  the  point  de¬ 
fect  energy  values  computed  by  Tersoff1  on  the  Tersoff- 3 
potential  and  the  recent  density  functional  theory  (DFT)  re¬ 
sults  given  by  Gocdecker  et  al and  Leung  et  ai 55  are  also 
listed  in  the  lahle. 

From  the  data  listed  in  Table  IV.  we  note  that  the  values 
of  the  vacancy  energy  given  by  Tersoff-3,  Tersoff-4*  and 
Tersoff- ARK  are  equal  and  are  comparable  to  that  obtained 
by  Gocdecker  et  al 54  under  the  local  density  approximation 
(LDA),  We  also  note  that  the  tetrahedral  interstitial  energy 
value  given  by  the  Tersoff-4  potential  is  -'14%  higher  and 
that  by  the  Tersoff- ARK  potential  is  -^36%  lower  than  the 
corresponding  LDA  value  determined  by  Leung  et  al 55  As 
regards  the  hexagonal  interstitial  energy,  ihe  value  given  by 
the  Tersoff-4  potential  is  —39%  higher  and  that  by  the 


TABLE  IV.  Point  defect  and  surface  energy  values. 


Tersoff-3 

Tersoff-4 

Tersoff-ARK 

Other 

Point  defect  energy  (eV) 

Vacancy  3  *7 4 

3.7 

3.7 

3.1 7,b  3.56c 

Interstitial  3,84 

3.9 

2.2 

3.43d 

(tetrahedral) 

Interstitial  4.7* 

(hex  agonal) 

4.6 

2.8 

3.3  lw 
2.87' 

Surface  energy  (eV/A2) 

Si(IOO)  0.144' 

0.144 

0.142 

SK  ill) 

0.080 

0.078 

“Reference  3. 

hRe  fere  race  54:  DFT  calculation  using  general  gradient  approxiroa- 
lion  (GGA), 

^Reference  54;  DPT  calculation  using  LDA. 

^Reference  55, 

'Reference  39. 


Tersoff- ARK  potential  is  —15%  lower  than  the  correspond¬ 
ing  LDA  value  given  by  Leung  et  ai  It  may  also  be  noted 
that  the  value  of  the  hexagonal  interstitial  energy  given  by 
the  Tersoff-ARK  potential  is  in  excellent  agreement  with  the 
corresponding  LDA  value  determined  by  Gocdecker  et  ai54 

To  compare  the  surface  energy  values  with  the  present 
results  for  the  Tersoff-4  and  Tersoff-ARK  potentials,  wrc  have 
included  the  results  of  Balamane  et  a!*9  for  the  Tersoff-3 
potential'  for  Si(lOO)  and  Si(lll)  surfaces  in  Table  IV.  We 
note  that  the  surface  energy  values  given  by  the  Tersoff-4 
potential  are  equal  to  the  corresponding  values  given  by  the 
Tersoff-3  potential,  and  the  values  given  by  the  Tersoff-ARK 
potential  arc  —2%  lower.  It  may  be  noted  that  all  six  values 
listed  in  the  table  for  surface  energy  correspond  to  the  ideal 
un  re  I  axed  configuration. 

The  agreement  between  the  Tersoff-4  and  Tersoff-ARK 
potentials  in  predicting  surface  energy  as  well  as  vacancy 
energy,  as  listed  in  Table  IV,  again  shows  that  the  Tersoff-4 
and  Tersoff-ARK  potentials  give  nearly  the  same  results 
when  the  deformation  in  the  bond  angles  and  bond  lengths  is 
small.  The  higher  values  of  hexagonal  interstitial  energy  as 
well  as  tetrahedral  interstitial  energy  given  by  the  Tersoff-4 
potential  show  that  as  compared  to  the  Tersoff-ARK  poten¬ 
tial  the  Tersoff-4  potential  makes  the  deformation  more  dif¬ 
ficult;  qualitatively,  this  observation  is  consistent  with  the 
fact  that  the  melting  point  given  by  the  Tersoff-4  potential  is 
higher  than  that  given  by  the  Tersoff-ARK  potential. 

e.  Other  points.  Figure  10  compares  the  radial  distribution 
function  (RDF)  of  liquid  Si  at  3000  K  given  by  Vy  (solid 
curve)  and  VjV  (dotted  curve).  Table  V  compares  the  loca¬ 
tions  of  the  first  and  next  peaks,  the  height  of  the  first  peak, 
and  the  full  width  at  half  maximum  (FWHM)  of  the  first 
peak.  For  comparison,  the  values  given  in  Ref,  3  for  Vm  and 
those  given  by  the  experiment56*57  are  also  reported  in  the 
table.  We  note  that  there  is  a  good  agreement  among  all  four 
sets  of  numbers. 

The  coordination  number  for  the  solid  crystal,  given  by 
V|V  as  well  as  Vv*  is  found  to  be  4.0,  which  is  in  agreement 
wiLh  the  tetrahedral  structure.  For  the  liquid,  however,  we 
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FIG,  10.  The  RDF  for  liquid  Si  at  3000  K;  solid  curve  for  the 
TcrsofFARK  potential  and  dotted  curve  for  the  Tersoff-4  potential. 

note  that  the  coordination  number  -5.4,  given  by  Vw,  which 
is  higher  than  the  value  -“4.7  given  by  Vtv  but  is  lower  than 
the  experimental  value  -6.4  (see  Table  III). 

Keblinski  et  at**  remark  that  with  the  exception  of  the 
Stillinger  and  Weber  (SW)  potential5  and  a  similar  potential 
developed  by  Mistriotis  et  ai?x  among  more  than  30  empiri¬ 
cal  potentials  for  silicon,  no  potential  other  than  the 
environmental-dependem  interatomic  potential  (EDIP)  de¬ 
veloped  by  Justo  et  at. lit  predicts  a  reasonable  melting  point. 
Recently,  there  has  been  a  number  of  studies  on  the  compu¬ 
tation  of  the  melting  point  of  Si  using  the  potential  of  Justo 
et  at.  The  melting  point  given  by  this  potential  is  found  to  be 
1572  K  by  Kaczmarski  et  at,**  1582  ±25  K  by  Miranda  and 
Antoroelli,60  and  1520±30  K  by  Keblinski  et  al .,5S  all  within 
10%  of  the  experimental  value,  1687  K,  For  comparison,  ihe 
value  of  the  melting  point  given  by  the  Tersoff-ARK  poten¬ 
tial  is  171 1  K,  With  respect  to  the  heat  of  fusion  and  density, 
the  EDIP  potential  gives59  a  heat  of  fusion  equal  to 
36,0  kJ  mol'1  and,  contrary  to  the  experiment,  it  predicts59  a 
decrease  in  density  of  Si  as  it  undergoes  the  phase  transition 
from  the  solid  phase  to  the  liquid  phase. 

It  would  be  interesting  to  compare  various  properties  of 
liquid  silicon  as  predicted  by  the  Tcrsoff-4,J  Tersoff-ARK, 
modi  tied -cm  bedded -atom -method  (MEAM)tH,2i  and  Still¬ 
inger  and  Weber  (SW)  potentials5  (see  Table  til  for  details). 
First,  we  note  that  none  of  these  models,  including  Tersoff- 
ARK.  provides  very  good  agreement  of  all  the  properties 


TABLE  V.  A  comparison  of  RDF  results  given  by  different  po¬ 
tentials  with  the  experiment. 


V 

V|vb 

Vm c 

Experiment 

lineal  ion  of  the  first  peak  (A) 

2,48 

2.44 

2,45 

2.5,“  2.41-2.46' 

Location  or  the  next  peak  (A) 

3M 

3.95 

3,90 

3.8.d  3.28-3.45' 

FWHM  of  first  peak  (A) 

048 

0.48 

0.52 

0.6d 

Height  of  (he  first  peak 

2.55 

2,50 

2.4 

2.5“ 

“Present  modification. 

TcrsofM  potential  as  given  in  Ref.  4. 


‘Reference  3. 
^Reference  56. 
'Reference  57 
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listed  with  the  experimental  values.  Second,  compared  to  the 
Tersoff-4  potential,  Tersoff-ARK  yields  better  agreement 
with  experiment  of  the  melting  point,  density  of  the  liquid, 
and  specific  heat  of  the  liquid.  Third,  presently,  Tersoff- 
ARK’s  shortcomings  are  the  prediction  of  lower  values  of 
heat  of  fusion  and  liquid  coordination  number.  Both  MEAM 
and  SW  models  give  better  agreement  of  the  heat  of  fusion 
and  liquid  coordination  number  with  the  experiment  com¬ 
pared  to  Tersoff-ARK,  but  at  the  expense  of  the  density  of 
liquid  (and  the  melting  point  in  ihc  case  of  MEAM).  Also, 
MEAM  and  SW  models  predict  specific  heat  of  the  liquid 
nearly  10%  and  20%  higher  than  the  experiment,  respec¬ 
tively.  With  respect  to  the  solid,  the  computed  results  are 
expected  to  be  nearly  the  same  for  the  Tersoff-4  and  Tersoff- 
ARK  potentials  when  the  temperature  ts  not  high  and  the 
structure  remains  tetrahedral.  Future  studies  will  focus  on 
further  modification  of  the  parameters  in  the  Tersoff-ARK 
potential,  such  that  these  two  properties  as  well  as  other 
properties,  such  as  elastic  constants  and  phonon  frequencies, 
agree  with  the  values  reported  in  the  literature.  This  may 
involve  modification  of  the  numerical  values  of  some  of  the 
parameters  in  the  TersofTs  potential  and/or  functional  modi¬ 
fication  of  the  parameters  such  that  a  majority  of  experimen¬ 
tal  values  agree  with  predictions. 

IV,  SUMMARY  AND  CONCLUSIONS 

The  void -nucleated  melting  method  has  been  employed  to 
compute  the  melting  point  of  Si  using  the  Tersoff-4  potential 
The  good  agreement  of  the  computed  melting  point  to  that 
determined  by  Yoo  et  at24  by  the  method  of  equilibration  of 
coexisting  and  liquid  phases  using  the  same  potential 
supports  the  appropriateness  of  the  void- nucleated  method, 
which  is  at  present  empirical. 

The  computed  value  of  the  ratio  of  the  melting  point  and 
mechanical  melting  point  /,  given  by  Eq,  (1),  is  found  to  be 
0,82,  This  value  of  /  is  comparable  to  those  computed  for 
van  dcr  Waals  and  molecular  crystals  of  Ar,  Ne,  CHjNCK 
RDX,  TNAZ,  ammonium  nitrate,  and  ammonium  dinitra- 
mide  as  well  as  those  given  by  the  theoretical  model  of  Lu 
and  Li53  for  a  large  number  of  metallic  crystals  of  Al,  Au, 
Co,  Cu,  Fe,  Mn,  Ni,  Pb,  Pd.  Pt,  Sb,  and  Sn;  for  all  these 
crystals,  /  is  found  to  be  in  the  range  0.77  to  0,85. 

The  validity  of  the  void -nucleated  melting  method  and  the 
fact  that  /— 0.8  for  a  large  number  of  systems  suggests  the 
need  for  a  more  rigorous  investigation  on  whether  the  exis¬ 
tence  of  the  plateau  region  in  the  void  size  versus  melting 
temperature  is  universal,  whether  the  melting  point  in  the 
plateau  region  corresponds  to  the  Thermodynamic  melting 
point,  and  whether  ihe  value  of  /— 0.8  is  also  a  universal 
result. 

In  agreement  with  the  previous  computed  results,  the 
present  studies  show  that  the  Tersoff-4  potential  overesti¬ 
mates  the  melting  point  by  more  than  50%  and  gives  the 
density  of  the  liquid  -“14%  Lower  than  the  experiment.  It  is 
also  noted  that  an  addition  of  a  long-range  Lennard -Jones 
(12-6)  interaction  to  the  Terosoff-4  potential  does  not  signifi¬ 
cantly  change  the  melting  point. 

The  present  studies  also  show  that  a  simple  modification 
in  3  of  the  12  parameters  of  the  Tersoff-4  potential  can  bring 
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ihc  melting  point  as  well  as  the  density  of  the  liquid  into 
good  agreement  with  experiment  without  altering  the  density 
and  energy  values  of  the  solid  crystal  at  room  temperature. 

A  comparison  of  density  over  a  wide  range  of  temper 
uires  shows  that  the  density  of  the  solid  predicted  by  the 
Tersoff- ARK  potential  (Tersoff  potential  with  the  modified 
parameters)  is  larger  than  that  given  by  the  Tersoff-4  poten¬ 
tial.  This  difference  is  not  appreciable  at  room  temperature, 
hut  increases  with  temperature  and  is  —  1%  at  the  melting 
point.  The  computed  cohesive  energy  of  the  solid,  the  point 
defect  energy  corresponding  to  a  vacancy,  and  the  surface 
energy  values  for  ( 1 00)  and  (111)  surfaces  arc  also  found  to 
be  almost  equal  for  the  modified  as  well  as  unmodified  pa¬ 
rameters.  The  point  defect  energy  values  corresponding  to 
the  hexagonal  and  tetrahedral  interstitials,  however,  are 
found  to  be  —40%  lower  when  the  modified  parameters  are 
used.  The  hexagonal  interstitial  energy  given  by  the  modified 
parameters  is  found  to  be  in  excellent  agreement  with  the 
corresponding  LDA  value  determined  by  Goedecker  el  ai 54 
The  coordination  number  of  the  liquid  and  specific  heat  arc 
also  found  to  be  in  better  agreement  with  experiment  when 
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the  modified  set  of  parameters  is  used.  It  may  be  of  interest 
to  investigate  further  to  see  if  the  Tersoff- ARK  potential  is  a 
better  choice  for  other  properties,  such  as  elastic  constants 
and  phonon  frequencies. 
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The  neural  network  (NN)  procedure  to  interpolate  ab  initio  data  for  the  purpose  of  molecular 
dynamics  (MD)  simulations  has  been  tested  on  the  Si02  system*  Unlike  other  similar  NN  studies, 
here,  wc  studied  the  dissociation  of  SiO?  without  the  initial  use  of  any  empirical  potential.  During 
the  dissociation  of  Si02  into  Si  +  O  or  Si  +  02,  the  spin  multiplicity  of  the  system  changes  from 
singlet  to  triplet  in  the  first  reaction  and  from  singlet  to  pentet  in  the  second*  This  paper  employs 
four  potential  surfaces.  The  first  is  a  NN  fit  [NN(STP)]  to  a  database  comprising  the  lowest  of  the 
singlet,  triplet,  and  pentet  energies  obtained  from  density  functional  calculations  in  6673  nuclear 
configurations.  The  other  three  potential  surfaces  are  obtained  from  NN  fits  to  the  singlet,  triplet, 
and  peniet-state  energies.  The  dissociation  dynamics  on  the  singlet-state  and  NN(STP)  surfaces  are 
reported.  The  results  obtained  using  the  singlet  surface  correspond  to  those  expected  if  the  reaction 
were  to  occur  adiabaticaily.  The  dynamics  on  the  NN(STP)  surface  represent  those  expected  if  the 
reaction  follows  a  minimum-energy  pathway.  This  study  on  a  small  system  demonstrates  the 
application  of  NNs  for  MD  studies  using  ab  initio  data  when  the  spin  multiplicity  of  the  system 
changes  during  the  dissociation  process.  ©  2(X)6  American  Institute  of  Physics . 

[DOl:  10. 1063/ L2 185638] 


I,  INTRODUCTION 

The  interpolation  of  ab  initio  potential  energy  surface 
(FES)  data  using  neural  networks1"5  (NNs)  and  other 
methods'1-24  for  the  purpose  of  executing  molecular  dynam¬ 
ics  simulations  has  been  a  subject  of  considerable  interest. 
Recently,  Raff  et  ai 1  have  investigated  in  detail  a  method 
involving  the  use  of  NNs  to  affect  the  interpolation  of  ab 
initio  PES  data.  The  results  of  such  studies1  performed  for 
Si5  and  vinyl  bromide  systems  show  that  the  NN  method  is 
very  promising  in  terms  of  accuracy,  convenience,  and  the 
requirement  of  CPU  time.  Our  recent  study2  on  the  vinyl 
bromide  dissociation  dynamics  demonstrates  the  success  of 
the  NN  method  for  molecular  dynamics  (MD)  simulations  of 
a  six-atom  system  having  six  open  reaction  channels* 

The  vinyl  bromide  system  so  investigated  is  somewhat 
simple,  at  least  in  the  sense  that  the  electronic  state  of  the 
system  for  various  dissociation  channels  studied  does  not 
change  during  the  dissociation  process.  That  is,  the  reactions 
arc  all  expected  to  occur  adiahatically  on  the  ground-state 
singlet  potential  surface.  However,  it  is  well  known  that 
many  chemical  reactions  for  various  systems  proceed  in  such 
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a  manner  that  the  system  changes  its  spin  state  during  and 
after  the  reaction.  For  example,  for  the  dissociation, 

SiO*-*  SiO +  G,  (I) 

the  system  is  initially  in  the  singlet  stale  but  it  is  in  the  triplet 
state  after  the  dissociation.  Similarly,  the  dissociation  given 
by  the  following  reaction: 

Si02“«  Si  +  02,  (2) 

corresponds  to  an  example  in  which  the  system  is  initially  in 
the  singlet  state,  whereas  the  products  are  in  the  pentet  state. 
Therefore,  in  pursuit  of  studying  the  applicability  of  the  NN 
procedure1”5  to  other  systems,  it  may  be  interesting  to  study 
the  reactions  given  by  Eqs,  ( 1 )  and  (2), 

One  special  feature  of  our  previous  study  on  vinyl  bro¬ 
mide  is  the  application  of  a  method  based  on  ihc  use  of  an 
empirical  potential  energy  surface  to  sample  the  configura¬ 
tions  of  the  molecule  in  a  12-dimcnsional  hypcrspacc.  There, 
we  generate  configurations  of  vinyl  bromide  by  running  a 
large  number  of  dissociating  trajectories  using  an  empirical 
potential.  Out  of  the  configurations  so  generated,  a  few  thou¬ 
sands  are  first  selected.  These  configurations  are  then  used  to 
compute  an  ah  initio  potential  database  which  is  employed  to 
generate  a  NN  potential  surface  on  which  additional  irajec- 

©  2006  American  Institute  of  Physics 


124,  134306-1 


Downloaded  19  Jul  2006  to  139JB.79 .49,  Redistribution  subjocl  to  AlP  license  or  copyright,  see  http://|cp.aip,org/icp/copyright,jsp 


134306-2  Agrawai  er  a/. 


tories  arc  computed  to  generate  more  configurations  whose 
energies  are  then  calculated  to  increase  the  size  of  the  ab 
initio  database.  By  repeated  application  of  such  additions 
using  a  novelty  sampling  (NS)  technique*1"  we  obtain  a  suf¬ 
ficient  number  of  data  points  to  reach  a  converged  PES.  This 
(MD/NS/NN)  method  thus  selects  the  important  regions  of 
phase  space  that  are  involved  in  the  reactions  under  consid¬ 
eration*  These  techniques  reduce  the  requirement  of  comput¬ 
ing  ab  initio  data  at  millions  of  configurations  to  a  few  thou¬ 
sands,  However*  the  method  has  a  limitation  that  it  is  not 
self-starting.  One  needs  to  start  with  an  empirical  potential. 
Therefore*  it  is  of  interest  to  see  if  a  system  such  as  Si02 
with  only  three  internal  degrees  of  freedom  can  be  investi¬ 
gated  without  the  initial  use  of  an  empirical  potential 

For  the  Si02  system*  where  only  three-dimensional  hy- 
perspacc  corresponding  to  three  coordinates  and  R$  is 

needed*  one  can  attempt  to  investigate  the  reactions  by  sam¬ 
pling  configuration  space  using  chemical  intuition  and  a  nar¬ 
row  grid  of  points  within  an  appropriate  range  of  Rx,  R2,  and 
Ry.  Rx *  R2 *  and  R ^  refer  to  the  Si-Ocl\  Si-0{2)*  and 
q(|)_q(2)  distances,  respectively.  Here,  superscripts  (1)  and 
(2)  have  been  used  to  distinguish  the  two  oxygen  atoms. 

In  this  paper*  we  shall  present  the  results  of  such  studies 
of  the  Si02  system.  In  the  next  section  we  have  described  the 
computational  procedure.  The  results  arc  then  presented  and 
discussed  in  Sec.  111.  Section  IV  presents  a  discussion  of 
possible  adiabatic  versus  nonadiabatic  processes  for  this  sys¬ 
tem,  The  results  are  summarized  in  the  final  section* 

II  COMPUTATIONAL  PROCEDURE 
A*  Ab  initio  data 

Ab  initio  calculations  have  been  performed  using  Gauss¬ 
ian  03  (Ref.  25)  at  B3LYP/6-31G*  level  at  3753  configura¬ 
tions*  For  each  configuration*  three  Gaussian  calculations 
have  been  run  by  varying  the  spin  multiplicity  (singlet,  trip¬ 
let*,  and  pemet)*  The  3753  configurations  have  been  obtained 
from  a  non  uniform  grid  in  RrR2-8  space,  where  0  is  the 
angle  between  and  R2  vecLors.  The  grid  spacing  in  the 
or  R2  dimension  is  varied  from  0,01  to  1.0  A,  The  minimum 
spacing  value  is  used  near  the  equilibrium  value  of  the  Si-O 
bond  length  and  the  maximum  value  is  used  for  large  values 
of  R}  or  fti.  The  bond  distances  arc  varied  from  1  to  10  A 
and  8  is  varied  from  0°  to  180°.  In  the  8  dimension*  the 
spacing  is  varied  from  3°  to  10°. 

Configurations  that  arc  very  different  from  those  associ¬ 
ated  with  reactions  (l)  and  (2)  have  been  ignored.  For  ex¬ 
ample.  configurations  corresponding  lo  Si+O  +  O  have  not 
been  considered.  When  Rx  and  R2  are  large*  configurations 
having  /?^>2.5  A  have  been  ignored. 

To  obtain  the  final  singlet,  triplet*  and  pentet  databases* 
we  have  discarded  all  those  poinLs  which  have  potential 
higher  than  V0*  where  VQ  corresponds  to  the  potential  energy 
of  the  Si( triplet) +0( triplet)  +  0( triplet)  system.  For  conve¬ 
nience,  we  have  scaled  the  potential  values  taking  VQ  as 
zero.  Finally,  symmetry  considerations  were  employed  lo  in¬ 
crease  the  size  of  the  database.  That  is*  for  a  configuration 
with  R ,=a*  R2~h'  and  if  we  added  a  point 

having  V=Valv  for  the  configuration  with  Rx=b,  Rz-at  and 
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TABLE  I  The  maximum  and  minimum  values  of  inpul  and  output  psinirn - 
eters. 


K 

Km* 

^  nun 

R,  (A) 

12 

10.0 

R;  <A) 

1,2 

10,0 

R,  (A) 

0*942  26 

10.0 

V  (eV)  (STP  surfacel 

-It. 840  5 

-0.0169 

V  (eV)  (S  surface) 

-11.H4Q5 

-00169 

J?3=c.  A  fourth  database  has  been  obtained  by  comparing 
singlet*  triplet*  and  pentet  energies  at  each  configuration  and 
selecting  the  minimum  of  these  three. 


B*  Neural  networks 

As  mentioned  earlier*  we  have  N  data  having  Rx,  R2y  and 
as  inputs  and  the  ab  initio  potential*  V(ab  initio },  as  out¬ 
put*  The  N  data  points  in  each  database  are  used  to  train  and 
test  a  two-layer  (3-40-1)  neural  network, 2<t  In  the  following 
discussion,  the  network  fits  to  the  singlet*  triplet,  pentet,  and 
combined*  minimum-energy  databases  are  denoted  as  NN(S)* 
NNfT),  NN(P),  and  NN(STP)*  respectively. 

For  training  the  neural  networks*  the  database  of  6673 
energies  was  partitioned  into  training  and  testing  sets  con¬ 
taining  5500  and  1 1 73  data  points*  respectively.  In  each  case* 
the  fitting  to  the  training  set  was  accomplished  iteratively 
using  3000  cycles  with  the  Lcvcnbcrg-Marquardt 
algorithm*27  To  obtain  the  best  fit*  we  chose  the  number  of 
neurons  («)  in  the  hidden  layer  to  be  40. 

From  such  training*  we  determined  W,,  bj,  W2*  and  b2 
matrices  of  dimensions  (n  X3).  (nX  1),  (l  X  n}*  and  ( I  X  1  )* 
respectively.  These  matrices  can  be  used~*  to  compute  the 
NN  potential  V^VforiNN)  for  any  desired  value  of  input 
matrix  £*=  or(/f|  .R2,R})  using  the  following  equations: 


V(a\  NN)  =  Wzaf  +  bj* 

(3) 

where 

a,  =  1/{I  +exp[-n,]}1 

(4) 

with 

nx  =  Wjfif+  bt. 

(5) 

Here,  n(l  a,*  and  a  represent  the  matrices  of  dimensions 
(n  x  1  )*  (nX  I)  and  (3  x  1)*  respectively.  The  elements  of 
matrix  a  are  the  input  parameters  tf3*  R2,  and  /?s. 

To  obtain  better  interpolation  accuracy,  wc  have  linearly 
scaled  the  input  and  output  parameters  in  the  range  of  -1  to 
1  using  the  following  relation: 

Ks  =  2{K  -  -  Km  J  -  ! ,  (6) 

where  Ks  is  ihc  sealed  value  of  K>  and  Kmn  and  Kmax  arc  the 
minimum  and  maximum  values  of  the  parameter.  After  com¬ 
puting  V'fo'jNN)  using  Eq.  (3),  wc  perform  the  reverse  scal¬ 
ing  procedure  to  obtain  the  actual  V(<*;NN).  The  values  of 
Kmin  and  for  all  parameters  are  given  in  Table  l 
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FIG,  I  The  potential  energy  in  eV  given  by  ihc  NN(STP)  neural  network  vs 
Ehe  corresponding  ah  initio  potential  energy  for  ihe  data  used  for  (a)  training 
and  (b)  testing. 

The  required  weight  and  bias  matrices  for  the  NN((STP) 
and  NN(S)  potential  surfaces  have  been  placed  on  deposit 
with  the  Electronic  Physics  Auxiliary  Publication  Service 
(EPAPS).2* 

To  obtain  the  forces  for  the  MD  simulations,  derivatives 
of  the  potential  have  been  determined  numerically.  It  may  be 
noted  that,  unless  otherwise  specified,  we  determine  potential 
energy  and/or  its  derivatives  for  a  given  configuration  using 
the  procedure  described  in  this  subsection. 

The  quality  of  fit  of  the  NN{STP)  surface  to  the  training 
set  is  shown  in  Fig.  1(a)*  It  shows  the  potential, 
V(ff;NN(STP)),  given  by  Eq,  (3),  versus  the  V(ab  initio) 
potential  for  all  5500  configurations  used  to  train  the  net¬ 
works.  A  similar  plot  for  the  remaining  1173  data  in  the 
testing  set  is  shown  in  Fig.  1(b),  Ideally,  the  points  in  such 
plots  would  lie  on  a  45°  line.  The  standard  deviation  of  the 
points  from  the  45°  line  is  a  measure  of  the  accuracy  of  the 
network.  Wc  find  that  a  rms  error  in  the  training  is  0.073  eV 
and  that  for  the  testing  set  is  0,078  eV,  which  is  within  the 
range  generally  regarded  as  “chemical  accuracy"  for  the  pur¬ 
poses  of  electronic  structure  calculations. 

Figures  2(a)  and  2(b)  show  analogous  plots  for  the 
NN(5)  adiabatic  potential  surface.  In  this  case,  the  rms  inter¬ 
polation  errors  for  the  training  and  testing  sets  arc  0.058  and 
0.078  eV,  respectively. 

C.  Molecular  dynamics  simulations 

The  Runge-Kutta  method  is  used  to  integrate  Hamilton's 
equations  of  motion.^'*1  An  integration  time  step  of  0.05  t.u. 
is  used  (1  t,u.=  1.018  X  IQ"14  s).  With  this  time  step  size,  the 
energy  is  conserved  to  an  accuracy  of  better  than  10-1  cV.  To 
study  dissociation  of  Si02  at  an  internal  energy  E,  a  set  of 
500  trajectories  is  run.  The  initial  configuration  is  obtained 
by  randomly  distributing  this  energy  E  into  three  normal 
modes  of  vibration.  For  the  initial  position  coordinates,  wc 


FIG.  2.  The  potential  energy  in  eV  given  by  the  NN(S)  neural  network  vs 
Ihe  corresponding  ab  miiia  potential  energy  for  the  data  used  for  (a)  I  raining 
and  (b)  testing. 

have  taken  the  equilibrium  configuration  of  SiG2,  and  the 
momenta  are  assigned  to  three  atoms  according  to  the 
method  based  on  normal  mode  vectors,  as  described  in  Ref. 
31,  The  normal  mode  vectors  for  this  purpose  have  been 
computed  using  GAUSSIAN  03,  The  use  of  this  method  ensures 
that  the  initial  states  for  the  study  of  the  decomposition  re¬ 
actions  hayt  a  microcanonical  distribution  of  the  internal  en¬ 
ergy  and  contain  no  linear  or  angular  momentum.  A  trajec¬ 
tory  is  allowed  to  run  for  40  000  time  steps;  if  dissociation 
occurs  earlier,  the  trajectory  is  terminated. 

111.  RESULTS  AND  DISCUSSION 
A,  Energetics  and  geometry 

Unless  otherwise  specified,  the  results  reported  in  this 
paper  refer  to  the  NN(STP)  surface. 

It  is  well  known  that  the  geometry  of  the  Si02  molecule 
in  the  gas  phase,  such  as  C02,  is  linear/2’14  Here,  we  noted 
the  same  and  found  that  the  equilibrium  Si-O  bond  length  is 
1,52  A.  Chu  et  al.n  performed  ah  initio  calculations  on  this 
system  at  the  density  functional  theory  (DFT)  level  using  a 
3-2 10  basis  set  for  silicon  and  6-3 1G*  basis  set  for  the  oxy¬ 
gen  atoms.  They  found  this  value  to  be  1.54  A.  Using  a 
molecular  orbital  approach  and  the  generalized  gradient  ap¬ 
proximation  in  the  density  functional  theory,  Nayak  et  ai  1 
determined  that  the  equilibrium  value  of  Si-0  bond  length  in 
Si02  is  1.53  A.  Further,  in  the  present  studies,  the  Si-O  bond 
length  in  SiO  is  found  to  be  equal  to  1.52  A. 

To  obtain  the  barrier  height  or  dissociation  energy  of  the 
reaction  given  by  Eq.  (1),  we  checked  and  noted  that  the 
minimum-energy  path  follows  the  linear  geometry.  i.e„  for  a 
given  value  of  Ry  and  ff2,  the  energy  is  minimum  when 

+  Therefore,  in  the  linear  geometry  for  a  fixed  value 
of  ff2,  we  have  varied  Ry  to  obtain  the  minimum  energy.  The 
variation  of  this  minimum  energy  with  Rj  on  the  NN(STP) 
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FIG,  3.  The  potential  energy  of  SiO:  as  a  function  of  on  the  NNfSTP) 
surface  corresponding  to  the  minimum  reaction  path,  as  described  in  the  lexi 
for  the  reaction  given  by  Eq,  (7). 

potential  surface  is  shown  in  Fig.  3.  From  this  curve,  we  find 
that  the  barrier  height  for  this  reaction  is  4.1  eV. 

The  reaction  given  by  Eq.  (2)  has  been  studied  in  two 
steps.  In  step  l*  for  a  fixed  value  of  0,  we  varied  Rt.  keeping 
R2  equal  to  /?h  to  obtain  the  minimum-energy  configuration. 
The  value  of  minimum  energy  so  obtained  as  a  function  of  6 
is  shown  in  Fig.  4.  Here,  we  note  an  existence  of  a  shallow 
minimum  at  0-55°.  ff|,  /?2,  and  Ry  values  at  this  minimum 
are  1.67,  1.67,  and  1.54  A,  respectively.  Thus,  we  infer  that 
though  Si02  is  most  stable  in  the  linear  geometry,  it  also  has 
a  triangular  mctastablc  state  which  is  2.2  eV  higher  than  the 
most  stable  linear  state.  The  barrier  height  for  the  transition 
out  of  this  mctastablc  state  is  about  0.6  eV. 

In  step  II,  wc  monitor  the  minimum-energy  path  on  the 
NN(STP)  surface  for  the  transition  from  the  triangular  Si02 
state  to  the  products  Si  +  02.  To  do  so,  in  the  nonlinear  ge¬ 
ometry  For  a  given  value  of  wc  vai7  Rj  to  a 

minimum  in  the  energy  versus  R |  curve.  The  minimum  en¬ 
ergy  so  obtained  as  a  function  of  R i  is  shown  in  Fig.  5,  From 
this  curve,  we  note  that  the  minima  at  /?j  =  /?2=1.67  A  cor¬ 
respond  to  the  mctastablc  state  obtained  in  step  L  The  energy 
difference  between  this  minimum  and  the  state  with  large 
R\(~Ri)  given  by  Fig.  5  is  equal  to  4.2  eV.  It  has  been  found 
that  the  value  of  Ry  corresponding  to  such  a  large  R j  is 
1.22  A,  As  expected,  this  value  of  Ry  is  in  agreement  with 
the  experimental  value  {=1.207  A)  {Ref.  35)  for  the  equilib¬ 
rium  bond  length  of  02,  Some  of  the  above  results  can  be 
expressed  succinctly  as  follows: 


FIG  4  The  potential  energy  of  SiOi  as  u  tune  non  of  0  comcKponding  in  the 
minimum  reaction  path  on  the  NNfSTP)  surface,  as  described  in  the  text  for 
the  reaction  given  by  Eq.  tS) 


FIG.  5,  The  potential  energy  of  SiCK  as  a  function  of  Rx  corresponding  to 
the  minimum  reaction  path  on  the  NNfSTP)  surface,  as  described  in  the  texi 
for  the  reaction  given  by  Eq.  (9L 

Si02  +  £,  =  SiO  +  O, 

(7) 

Si02(  1  inear)  +  £2  -  Si 02  ( triangu  1  ar) , 

(8) 

Si02(lriangular)  +  Ey  =  Si  +  02, 

(9) 

where  E|=4.1  eV,  E2- 2.2  eV,  and  £^=4,2  eV.  Combining 
Eqs,  (8)  and  (9),  we  obtain 

SiOjOinear)  +  £4  =  81  +  02.  ( 1 0) 

where  £4 =6.4  eV,  The  above  barrier  heights  and  energies 
have  been  obtained  without  any  consideration  for  the  small 
corrections  due  to  zero- point  energy. 

It  may  be  noted  that  the  value  of  E(  =4. 1  eV  given  by  the 
NN(STP)  and  ab  initio  data  in  our  present  studies  is  very 
close  to  4.013  eV  given  directly  by  our  ab  initio  data  without 
the  use  of  NN  fitting.  Lu  et  ai}A  also  performed  similar 
B3LYP/6-31G(r/)  ab  initio  calculations  using  the  GAMESS 
(Ref.  36)  code  and  found  the  value  of  E}  =  3.953  eV.  Simi¬ 
larly,  against  £4=6.4  eV  given  by  the  NN(STP}  surface  in 
the  present  studies,  we  obtained  £4 =6.432  eV  directly  from 
the  ab  initio  data  and  Lu  et  af,  M  got  £4=6 JtO  eV  using  the 
gamess  software.  These  results  show  that  the  error  associ¬ 
ated  with  the  NN  fitting  is  comparable  to  the  inaccuracy 
associated  with  the  ah  initio  data.  The  differences  in  the  ah 
initio  results  may  be  due  to  small  differences  in  the  numeri¬ 
cal  procedures  in  the  GAMESS  and  GAUSSIAN  codes. 

As  expected,  the  value  of  £2=2.2  eV  given  by  the 
present  studies  is  also  in  good  agreement  with  2,108  eV 
computed  by  Lu  et  ai.u  using  the  GAMESS  code  at 
B3LYP/6-3  IG{d)  level.  The  Si-0  equilibrium  bond  length 
of  L67  A  noted  in  the  present  studies  for  Si02  in  the  trian¬ 
gular  geometry  is  comparable  with  the  Si-O  bond  lengths 
computed  by  Chu  et  at.  for  various  large  clusters  of  oxides 
of  silicon:  Chu  et  atK  noted  that  the  large  clusters  of  oxides 
of  silicon  are  composed  of  two  buckled  rings  where  the  Si-0 
bond  length  ranges  from  1 .67  to  1 .96  A ;  they  also  noted  that 
the  value  of  the  Si-O  bond  length  in  various  Sh04  rings  in 
the  SinOm  clusters  with  m&n+ 2  is  1,66  A, 

In  regard  to  comparison  with  the  experiment,  the  values 
of  £4  given  by  the  present  studies  as  well  as  by  Lu  et  alu 
differ  from  the  experimental  value17  of  6.769  cV  by  -“5%. 
For  the  binding  energy  of  SiO,  we  note  a  difference  of  —5% 
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FIG.  6,  The  vanaiian  of  \n{NJN„)  as  a  fundion  of  lime  ( t )  in  picoseconds  at 
£=4,5  eV  computed  on  the  NN(STP)  surface. 

from  the  experiment.  Wc  obtain  a  value  of  7,828  cV  for  this 
quantity  in  good  agreement  with  7.738  eV  obtained  by  Lu  el 
al.u  using  the  gamess  code  at  B3LYP/6-3lG(d)  level.  The 
reported  experimental  value1  is  8.237  eV. 

It  may  be  interesting  to  note  the  spin  states  of  the  sys¬ 
tems  studied  here.  Wc  see  that  in  Eq+  (7),  SiQ2  is  in  a  singlet 
state  but  the  products  of  this  reaction  are  in  the  triplet  state. 
In  Eq,  (8),  Si02  on  both  sides  is  in  a  singlet  state,  but  in  Eq. 
(9)  or  (10),  the  Si+02  system  is  in  a  pentet  state.  If  we 
confine  our  studies  to  an  adiabatic  reaction  on  the  NN($) 
surface,  the  dissociation  energy  for  the  reaction  given  by  Eq. 
(1)  comes  out  to  be  6.8  cV  instead  of  4.1  eV. 


B*  Trajectories 

Trajectories  have  been  computed  on  both  the  NM(STP) 
and  NN(S)  surfaces  using  the  procedure  described  in  Sec, 
II  C  to  study  the  dissociation  reaction  to  SiO+O  products,  A 
typical  decay  ploL  for  dissociation  on  the  NN(STP)  surface  at 
vibrational  energy  £=4.5  cV  is  shown  in  Fig,  6.  The  near 
linearity  of  the  result  shows  that  the  dissociation  process  fol¬ 
lows  first-order  kinetics,  Le.» 

N  =  NfIcxp(~kl)t  (II) 

where  N  is  the  number  of  imdissociated  trajectories  at  lime  t 
and  Nn  is  the  value  of  N  at  f-0.  As  mentioned  earlier,  we 
have  taken  /Vfl=500,  It  may  be  mentioned  that  a  typical  tra¬ 
jectory  running  for  1500  time  steps  takes  about  1  s  of  CPU 
time  on  our  1.4  GHz  machine. 

From  the  slope  of  the  decay  plot  we  obtain  the  dissocia¬ 
tion  rate  coefficient,  k .  The  rate  coefficients  for  a  few  values 
of  E  in  the  range  of  4. 3 -5.0  cV  have  been  computed  with 
the  WN(STP)  surface.  These  coefficients  lie  in  the  range  from 
0,304  to  4,274  p $T].  If  the  reaction  is  assumed  to  occur  adia- 
batically  on  the  singlet  NN(S)  surface,  the  corresponding  rate 
coefficients  are  much  smaller  due  to  the  reaction  e  n  dot  her- 
m icily  being  2,7  cV  larger  for  the  process.  All  results  are 
given  in  Tabic  N. 

The  dependences  of  dissociation  rate  coefficients  on  en¬ 
ergy  E  for  both  the  NN(STP)  and  NN(S)  surfaces  are  shown 
in  Fig.  7.  This  plot  shows  that  the  dissociation  rales  follow 
the  R  ice- Rams  pc  rger- Kassel  (RRK)  equation*™ 
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TABLE  II.  Reaction  rate  coefficient  (A)  for  the  reaction  SiO?=SiO+0. 


Energy  (eV) 

k  (ps*1)  NN(S) 

k  (ps'1)  NNISTP) 

4,3 

... 

0.304 

4.4 

... 

0.548 

4.5 

... 

0.867 

4.6 

... 

1.123 

4.8 

... 

2,266 

5.0 

4.274 

7.0 

0.0314 

... 

72 

0.225 

7,4 

0467 

7,5 

0.6 18 

8.0 

1.456 

B.5 

2.397 

k=f{\  -E/E„y-'.  (12) 

where  E0  is  the  critical  energy  that  can  be  taken  equal  to  the 
barrier  height  of  the  dissociation  reaction,  j  is  a  parameter 
that  gives  the  number  of  vibrational  degrees  of  freedom  par¬ 
ticipating  in  the  reaction*  and  /  is  the  frequency  factor  or 
high-energy  limit  of  the  rate  coefficient.  For  this  plot,  we 
have  taken  E„=4, 1  and  6.8  eV  for  the  NN(STP)  and  NN(S) 
surfaces*  respectively.  A  least-squares  fit  of  the  data  shown  in 
Fig,  7  yields  corresponding  s  values  of  2,90  and  3. 1 9  and  / 
values  of  92.4  ps“'  (3080  cm-1)  and  97.6  ps"1  (3253  cm"1). 
These  values  of  s  arc  in  good  agreement  with  Lhe  number  of 
vibrational  degrees  of  freedom  participating  in  the  reaction, 
which  is  equal  to  three  due  to  a  rapid  energy  transfer  among 
all  the  three  modes.  The  computed  frequency  factors  /  are 
nearly  twice  the  asymmetric  stretching  frequency 
(1449.8  cm^1)  of  Si02  given  by  the  Gaussian  calculation  at 
the  B3LYP/6-31G*  level.  The  factor  of  2  is  easy  to  under¬ 
stand  from  the  fact  that  Si02  has  two  oxygen  atoms  and  the 
reaction  given  by  Eq.  ( l )  can  take  place  in  two  ways. 

To  investigate  the  dissociation  of  Si02  into  Si+02,  the 
vibrational  energy  must  be  in  excess  of  6.4  eV  [see  Eq.  (10)]. 
Therefore,  we  inserted  a  total  vibrational  energy  £  as  high  as 
8.0  eV  randomly  in  all  modes  and  computed  more  than  2000 
trajectories  but  all  trajectories  are  found  to  lead  to  dissocia¬ 
tion  to  SiO+O,  We  also  put  the  same  total  energy  in  a  single 


FIG-  7,  RRK  pkti  for  lhe  dissociation  rule  k  in  ps-1  for  ihc  dissociation  of 
Si02  inio  SiO+O.  The  circles  «ind  solid  line  correspond  to  the  date  obtained 
using  lhe  NN(STP)  potential  wiih  £„=4.1  eV.  The  triangles  and  the  dotted 
line  correspond  lo  Ihc  data  ohi  timed  using  lhe  NN(S)  potential  surface  with 
£„^6.g  eV. 


Downloaded  19  Jul  2006  to  139.76  79,49  Redistribution  subject  1o  AIP  license  or  copyright,  see  hltp:tfjcp.atp,org/jcp/copyright.jsp 


1 34306-6  Agrawal  e\  ai 


221 

J.  Chem.  Rhys.  124,  134306  (20061 


•  3 

- : 

•  f 

•  % 

(a) 

(h) 

<e> 

• 

- 

•  % 

m 

(*) 

(4) 

% 

4 

m 

u> 

#  (h> 

•  ® 

RG,  K  The  local  ion  of  the  moms  during  ihe  ciimbinttiion-dissocmtion  re* 
action  given  by  Fq.  ( 14)  for  a  typical  trajectory  at  time  steps:  (a)  lT  (b)  200. 
(c)  400,  Id)  500,  (e)  6CX),  if)  800,  (g)  1000,  (h)  1200,  and  ti)  1600;  I  time 
step =0,509  fs.  Each  of  the  nine  boxes  shown  here  is  of  size  12  X  12  A  in  the 
plane  in  which  the  reaction  occurs. 

bending  mode  but  we  could  not  get  Si+02.  This  may  be  due 
to  the  fact  that  the  SiO+O  channel  has  a  veiy  low  barrier 
height  relative  to  that  for  the  Si+02  channel.  Therefore,  the 
dissociation  into  SiO+O  occurs  with  very  high  probability, 
precluding  the  formation  of  Si+03  products. 

To  test  the  ability  of  the  KN(STP)  potential  energy  sur¬ 
face  to  properly  represent  the  reaction  channel  leading  to 
Si+02  products,  we  investigated  the  reaction  given  by  Eq. 
(2)  in  the  back  direction,  i,e„ 

Si  +  02->Si02*  (13) 

We  have  found  that  although  Si02  is  formed  in  the  collision 
of  Si  with  Oj,  in  all  cases  Si02  immediately  dissociates  into 
SiO  +  O.  Figure  8  depicts  a  typical  trajectory  undergoing 
such  a  combination  and  dissociation  process,  Le., 

Si  +  Oz  Si02  SiO  +  O.  (14) 

In  this  trajectory,  we  started  with  the  initial  configuration  as 
shown  in  Fig.  8(a)  and  imparted  a  small  relative  velocity  to 
Si  atom  in  the  direction  of  the  center  of  mass  of  02,  Wc  also 
initially  inserted  small  vibrational  and  rotational  energies 
into  the  G2  molecule.  Figure  9  shows  the  temporal  behavior 
of  the  interatomic  distances  in  the  trajectory.  Initially,  R j  and 


FIG  9.  The  variation  of  R j,  and  Rx  tin  A)  as  a  function  of  time  r  for  a 
typical  trajectory  undergoing  a  combination- dissociation  reaction  given  by 
Eq.  f 141, 


FIG.  10.  Variation  of  the  four  potentials  along  the  $iG2— *SiG+Q  reaction 
coordinate.  The  results  for  Ihe  singlet,  triplet,  pentet,  and  minimum -energy 
reaction  pathways  are  labeled  in  the  figure  as  NN(S),  NN(T),  NN(P),  and 
NN(STP),  respectively.  The  crossing  point  of  the  NNfS)  and  NNfSTP)  po¬ 
tentials  is  seen  at  a  Si-G  distance  of  2.1  A.  The  energy  at  the  crossing  point 
relative  to  that  for  equilibrium  Si02  is  3.5  eV. 

R2  arc  large  while  is  small.  At  j— 0.35  ps,  fl,  and  R2  both 
become  small  and  R$  starts  increasing,  i.c.,  two  Si-0  bonds 
tend  to  be  formed  and  the  0-0  bond  begins  to  break.  Fur¬ 
thermore,  we  note  that  very  soon  the  system  appears  as 
SiO+O  (demonstrated  by  a  small  value  of  /?,  and  large  val¬ 
ues  of  R2  and  /f3).  This  trajectory  thus  reveals  that  the  po¬ 
tential  is  capable  of  representing  scission  of  the  0-0  bond 
and  formation  of  Si-O  bonds.  However,  the  large  exoiher- 
micity  of  the  recombination  process  causes  the  newly  formed 
Si02  molecule  to  undergo  rapid  dissociation  to  SiO+O, 

IV.  ADIABATIC/NON  ADIABATIC  PROCESSES 

The  investigations  of  the  Si02  dynamics  reported  here 
represent  two  limiting  cases.  If  dissociation  to  SiO+O  occurs 
adiabahcaliy  on  the  singlet  surface  represented  by  NN(S),  it 
will  be  a  high-energy  process  with  a  threshold  slightly  below 
6.8  eV  when  zero-point  energy  effects  are  considered.  If. 
however,  the  dissociation  follows  the  lowcst-cncrgy  pathway 
leading  from  Si02  to  SiO+G,  dissociation  can  occur  at  sig¬ 
nificantly  lower  internal  energies  with  a  threshold  around 
4,0  eV,  In  addition  to  these  two  limiting  cases,  there  exist  a 
very  large  number  of  other  possible  pathways  that  involve 
transitions  between  the  singlet  and  triplet  surfaces  aL  points 
other  than  the  crossing  points.  Such  pathways  might  be  in¬ 
vestigated  using  a  surface  hopping  procedure  such  as  that 
suggested  by  TuIIy.3<? 

A  simple  thermodynamic  argument  suggests  that  the 
minimum  pathway  mechanism  represented  by  the  NN(STP) 
surface  would  be  favored.  However,  such  a  view  ignores  the 
fact  that  the  system  must  be  able  to  undergo  a  change  in  spin 
state  in  the  time  available  when  the  singlcMriplcl  seam  is 
reached*  If  the  spin -orb it  and  nonadiabatic  coupling  terms 
are  not  sufficiently  strong  to  induce  such  a  multiplicity 
change,  the  dissociation  process  might  pass  through  the  scam 
without  a  change  of  spin  state  and  continue  on  the  adiabatic 
singlet  surface.  In  the  absence  of  a  careful  computation  of 
these  coupling  terms  in  a  surface  hopping  investigation.  1  wc 
cannot  definitively  answer  these  questions  related  to  the  Si02 
dissociation  mechanism. 
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One  factor  that  favors  a  near  minimum-energy  paLhway 
mechanism  is  the  location  of  the  singlet-triplet  crossing  point 
along  the  reaction  coordinate  for  dissociation  to  SiO+O. 
This  is  shown  in  Fig.  10.  This  figure  shows  the  computed  ab 
initio  energies  for  the  singlet,  triplet,  and  pentet  states  along 
the  dissociation  coordinate,  The  NN  fits,  NN(S),  NN(T),  and 
NN(P),  are  shown  as  points.  The  minimum-energy  path  on 
the  NN(STP)  surface  is  the  solid  curve.  The  crossing  point 
between  the  NN(T)  and  NN(S)  surfaces  occurs  at  an  energy 
of  3.5  eV  above  the  Si02  minimum.  Therefore,  dissociating 
trajectories  with  a  total  internal  energy  of  4.3  eV  will  have 
only  0,8  eV  of  internal  vibrational  energy  remaining  when 
the  seam  is  reached.  This  energy  will  be  distributed  through¬ 
out  three  vibrational  degrees  of  freedom.  Consequently,  we 
would  expect  the  energy  remaining  in  the  reaction  coordinate 
to  be  about  0,3  eV.  The  relatively  heavy  silicon  and  oxygen 
atoms  will  not  be  moving  very  fast  at  such  energies.  Under 
these  conditions,  we  might  expect  the  system  to  remain  in 
the  region  of  the  crossing  point  for  a  sufficient  lime  to  permit 
spin-orbit  and  other  adiabatic  coupling  terms  to  induce  a  spin 
transition  to  Lhe  triplet  state. 

A  more  quantitative  evaluation  of  the  extent  of  surface 
hopping  in  this  system  will  require  a  careful  computation  of 
the  spin-orbit  coupling  and  transition  probabilities. 


V.  SUMMARY  AND  CONCLUSIONS 

Ah  initio  potential  energy  values  for  Si02  have  been 
computed  in  3753  configurations  using  GAUSSIAN  03  at  the 
R3LYP/6-31G*  level  of  theory.  At  each  configuration,  the 
energy  values  arc  calculated  for  singlet,  triplet,  and  pen  tel 
spin  states.  For  further  use,  out  of  these  three  values,  we 
picked  that  value  of  energy  which  is  minimum  to  create  a 
fourth  database.  By  discarding  highly  repulsive  configura¬ 
tions  and  by  adding  some  configurations  based  on  the  sym¬ 
metry  of  the  molecule,  we  obtained  ah  initio  data  points  to 
interpolate  for  each  of  the  four  databases. 

Two-layer  (3-40-1)  neural  networks  are  used  to  interpo¬ 
late  between  the  points  in  each  database.  The  NN  surfaces 
thus  obtained  are  denoted  NN(S),  NN(T),  NN(P),  and 
NN(5TP)  for  the  singlet,  triplet,  pentet,  and  minimum-energy 
databases,  respectively.  Trajectories  have  been  computed  us¬ 
ing  the  interpolated  data  provided  by  the  NN{S)  and 
NN(STP)  surfaces.  Thus,  the  method  is  a  self- starting  one 
which  does  not  require  the  initial  use  of  an  empirical  poten¬ 
tial,  The  following  results  obtained  in  this  study  show  that 
the  NN  interpolation  process  is  very  good, 

(1)  The  rms  interpolation  error  on  the  NN(STP)  surface 
associated  with  the  training  set  is  0,073  eV  and  that 
with  the  testing  set  is  0.078  cV.  This  error  is  within  the 
range  generally  regarded  as  "chemical  accuracy11  for 
the  purposes  of  electronic  structure  calculations.  For 
the  NN(S)  surface,  the  corresponding  rms  errors  are 
0.058  and  0.078  eV. 

(2)  The  difference  in  the  binding  energy  or  dissociation 
energy  of  SiO  as  well  as  SiG2  between  the  direct  ah 
initio  results  and  those  given  by  the  NNs  is  less  than 
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0J  cVr  This  difference  is  comparable  with  that  given 
by  the  differences  in  the  values  given  by  GAUSSIAN  oa 
and  GAMES5. 

(3)  The  dissociation  rate  coefficients  for  the  reaction, 
Si02-*5iO+0,  follow  the  RRK  equation  [see  Eq. 
(12)].  The  values  of  s  and  /  parameters  of  the  RRK 
equation  for  this  reaction  arc  found  to  be  close  lo  the 
expected  values  based  on  the  number  of  degrees  of 
freedom  participating  in  the  dissociation  process,  the 
vibrational  frequency  of  the  asymmetric  stretching 
mode,  and  the  number  of  possible  ways  in  which  the 
molecule  can  dissociate.  This  is  the  case  whether  the 
reaction  occurs  adiabatically  on  the  NN(S)  surface  or 
follows  a  minimum-energy  path  on  the  NN(STP) 
surface. 

This  study  demonstrates  that  the  spin  multiplicity  can  be 
appropriately  and  easily  considered  while  using  a  NN  to  in¬ 
terpolate  the  ah  initio  data  to  run  MD  simulations. 

As  regards  geometry  of  Si02,  we  have  found  that  the 
most  stable  state  is  linear  with  a  Si-0  bond  length  of  1.52  A. 
This  value  is  in  good  agreement  with  that  computed  by  Chu 
et  ain  as  well  as  Nayak  et  al.y*  Further,  for  a  Si-0  bond 
length  in  SiO,  we  get  the  same  value  (1.52  A).  For  the  dis¬ 
sociation  energy  of  SiO,  and  Si02  (into  Si+02),  wc  found 
that  the  present  results  are  in  very  good  agreement  with  those 
obtained  by  Lu  et  a/.34  using  GAMESS.  However,  these  energy 
values  arc  found  to  be  about  5%  lower  than  the  experimental 
data.37  In  agreement  with  the  previous  results  of  Lu  et  o/.,u 
we  have  seen  that  Si02  exists  in  a  triangular  state  which  is 
2. 1  eV  higher  than  the  most  stable  linear  configuration.  Wc 
further  note  that  the  barrier  height  to  exit  from  this  triangular 
stale  to  lhe  linear  state  is  0.6  eV.  The  Si-Q  bond  length  in 
this  triangular  Si02  state  is  L67  A,  which  is  in  good  accord 
with  the  value  of  L66  A  noted  by  Chu  et  for  the  Si-0 
bond  length  in  various  Si204  rings  in  the  Si„Om  clusters  with 
m  ^/t  +  2. 

Dissociation  processes  on  the  NN(S)  and  NN(STP)  sur¬ 
faces  represent  two  limiting  cases.  The  Former  is  a  high- 
energy  dissociation,  whereas  the  latter  describes  dissociation 
along  a  minimum-energy  pathway  between  Si02  and  SiO 
+0  that  requires  a  spin-state  transition  at  the  crossing  points 
between  singlet  and  triplet  surfaces.  In  addition  to  these  two 
limiting  cases,  there  exist  a  very  large  number  of  other  pos¬ 
sible  pathways  that  involve  transitions  between  the  singlet 
and  triplet  surfaces  at  points  other  than  the  crossing  points. 
It  is  suggested  that  both  thermodynamic  considerations 
and  the  energies  at  the  singlet-triplet  crossing  points  favor 
reaction  along  the  minimum-energy  pathway.  The  crossing 
points  occur  far  along  the  reaction  coordinate  near  the  prod¬ 
uct  valley  of  the  surface.  This  means  the  energy  remaining  in 
motion  along  the  reaction  coordinate  will  be  small.  This 
point  and  the  fact  that  the  dissociation  coordinate  involves 
the  relative  motion  of  heavy  Si  and  O  atoms  suggest  that  the 
system  will  remain  in  the  region  of  the  crossing  scam  for  a 
sufficient  lime  to  permit  spin-orbit  and  adiabatic  couplings  to 
induce  a  spin  transition. 


Downloaded  19  Jul  2006  lo  139.78.79.49.  Redistribution  subject  to  AIR  license  or  copyright,  see  ht1p://jCp.aip.org/jcp/copyrighl  jsp 


134306-8  Agrawal  el  al. 

ACKNOWLEDGMENTS 

This  project  is  funded  by  grants  from  the  National  Sci¬ 
ence  Foundation  (DM  1-0200327  and  DM1-0457663).  We 
thank  Dr.  W.  DeVries,  Dr,  G.  Haxelrigg,  Dr,  J,  Cao.  and  Dr. 
D.  Durham  of  the  Division  of  Design,  Manufacturing,  and 
Industrial  Innovation,  Dr.  B.  M,  Kramer,  Engineering  Cen¬ 
ters  Division,  and  Dr,  J.  Larsen  Basse,  Tribology  and  Surface 
Engineering  program  for  their  interest  and  support  of  this 
work.  This  project  was  also  sponsored  in  part  by  a  DEPS- 
CoR  grant  on  the  Mulliscale  Modeling  and  Simulation  of 
Material  Processing  (F49620-03- 1  -028 1 ).  The  authors  thank 
Dr.  Craig  S.  Hartley  and  Dr.  J.  Tiley,  Program  Managers  of 
the  Metallic  Materials  Program,  of  the  Air  Force  Office  or 
Scientific  Research  (AFOSR)  for  their  interest  in  and  support 
of  this  work.  One  of  the  authors  (R.K.)  also  thanks  A.  H. 
Nelson,  Jr.,  endowed  chair  in  engineering  for  additional 
support, 

'  L.  M.  Raff.  M.  Malshe,  M,  Hagan,  D.  1.  Doughan,  M.  G.  Rockley.  and  R. 
Komunduri,  J.  Chem.  Phys.  122,  084 1 04  (2005). 

*D.  I.  Doughan .  L  M  Raff.  M.  G.  Rockley.  M,  Hagan,  P.  M,  Agrawal* 
and  R.  Komamluri,  J-  Chem,  Phys  124.  054321  (2006). 

JS.  Loren/,  A  GroB.  and  M  Scheffter,  Chem.  Phys.  Leu.  395,  210 
(2004). 

4D  F  R.  Brown,  M  N.  Gibbs,  and  D.  C.  Gary,  J,  Chem.  Phys.  105,  7597 
(1996k 

*T  B.  Blank.  S.  D.  Brown,  A,  W,  Calhoun,  and  D  i.  Dorcn,  J,  Chem, 
Phys.  103.4129(1995) 

ftJ  Ischtwan  and  M  A.  Collins.  1.  Chem,  Phys.  100.  8080(1994). 

7  k.  A  Nguyen.  1.  Rossi,  and  D  G,  Trnhlar,  l  Chem  Phys,  103.  5522 
(1995). 

aT.  Ishida  and  G.  C.  Schaiz,  J  Chem,  Phys.  107.  3558  (1997k 

*K.  C.  Thompson,  M.J.T,  Jordan,  and  M,  A.  Collins,  J,  Chem.  Phys.  108, 

8302  (1998).  *  to 

mL  A  Pederson,  G  C  Schaiz.  T.-S.  Ho.  T  Hollebeek*  H.  Rabitz*  L.  B. 

Harding,  and  G.  Lendvay,  j.  Chem  Phys,  110,  9091  (1999). 

"  R  P  A  Bellens  and  M  A  Collins,  J  Chem,  Phys.  HI.  816  (1999), 
t:R.  P  A  Belle  ns.  M  J  T.  Jordan.  D.  H.  Zhang,  and  M,  A.  Collins,  J 
Chem.  Phys.  112,  10162  (2000). 

,JD  H  Zhang,  M.  A,  Collins*  and  S  -Y.  Lee.  Science  290,  961  LOGO). 
tdL  A  Pederson*  G,  C,  Sehalz,  T.  Hollcbeek,  T.-S.  Ho.  H.  Rabitz,  and  L. 
B  Harding,  J.  Phys.  Chem,  A  104.  2301  (2000k 


223 

J  Chem.  Phys.  124.  134306  (2006) 

15  K.  Song  and  M  A.  Collins,  Chem.  Phys.  Let<  335,  481  (2001) 

*G,  E.  Moyano  and  M,  A,  Collins,  J  Chem  Phys  119.  5510  (2003), 

17  M,  A.  Collins  and  L.  R adorn.  J.  Chem.  Phys.  118*  6222  (2003). 

,HR.  Z  Paseunl.  G.  C.  Schatz.  G.  Lendvuy,  and  D.  Troya,  J.  Phys  Chem  A 
106.4125  (2002). 

|1JM.  A,  Collins.  Theor.  Chem.  Acc.  108.  313  (2002), 

»C.  Crespos*  M  A.  Collins,  E.  Pijper,  and  G.  J.  Kroes,  Chem.  Phys.  Lull, 
376.  566  (2003). 

21 C,  Crespos,  M.  A.  Collins,  E.  Pijper,  and  G.  J.  Kroes,  J  Chem.  Phys. 

120*  2392  (2004).  „  ,  _  . 

22  G,  G,  Maisuradze,  D.  L,  Thompson,  A.  F.  Wagner,  and  M.  Mmkoff,  J, 
Chem.  Phys.  119*  10002  (2003). 

2JA  Kawano,  Y.  Guo,  D.  L,  Thompson,  A.  F  Wagner,  and  M.  Minkuff,  J. 
Chem.  Phys.  120*6414(2004)* 

»  Y  Guo.  A.  Kawano,  D.  L.  Thompson.  A,  F.  Wagner,  and  M.  Mmkoff.  J, 
Chem,  Phys.  121*5091  (2004), 

i.  Frisch.  G.  W.  Trucks,  H.  B  Schlcgd  et  al.  gaussian  Ok  Revision 
C.02,  Gaussian*  Inc,.  Wallingford.  CT,  2004. 

26  M.  T.  Hagan,  H.  B  Demuth,  and  M  Beale,  Neural  Newark  Design 
(PWS,  Boston.  MA,  1996), 

27  M  T.  Hagan  and  M.  Mcnhaj.  IEEE  Trans,  Neural  Netw.  5,  989  (1994) 
2*See  EPAPS  Document  No  E-JCPSA6-124-0Q86I 3.  This  document  can  be 

reached  via  direct  link  in  the  online  article ‘s  HTML  reference  section  or 
via  the  EPAPS  homepage  (hup://www.aip,OTg/pubservs/c paps.html). 

®M.  P.  Allen  and  D.  J  Tildesley  ,  Computer  Simulation  of  Liquids  (Clar¬ 
endon,  Oxford,  2002). 

WL.  M.  Raff  and  D,  L.  Thompson,  in  Theory  of  Chemical  Reaction  Dy* 
namics,  edited  by  M.  Baer  (CRC.  Boca  Raton.  FL,  1985),  Wit  111 

”  E.  B.  Wilson,  Jr..  S.  C.  Dec i us,  and  R  C.  Cross,  Molecular  Vibrations 
(Dover*  New  York,  1955). 

33  T.  5,  Chu.  R,  Q.  Zhang,  and  H,  F.  Cheung.  J.  Phys,  Chem.  B  105,  1705 

(2001). 

“S.  K,  Nayak,  B  K.  Rao*  S  N  Khanna.  and  P,  kna,  J.  Chem,  Phys,  I01*. 

1245  (1998).  „  ^  J 

MW  C.  Lu,  C.  Z.  Wang.  V.  Nguyen,  M.  W  Schmidt,  M,  S  Gordon,  and  K, 

M.  Ho,  J,  Phys,  Chem.  A  107.  6936  (2003)* 

35  CRC  Handbook  of  Chemistry  and  Physics,  edited  by  D,  K,  Lule  (CRC* 
Boca  Raton,  FL,  2004), 

WM.  W,  Schmidt,  K.  K,  Baldridge,  J.  A,  Boatz  et  al,  J.  Comput.  Chem 
14,  1347  (1993), 

31  NIST-JANAF  Thermochemical  Tables.  4th  ed.,  edited  by  M  W.  Chase.  Ir.. 
J.  Phys,  Chem.  Ref,  Data,  Monograph  No  9.  (National  Institute  t>r  Stan¬ 
dards  and  Technology,  Gaithersburg*  MD,  1998). 

58  K.  J.  Lai  die  r.  Theories  of  Chemical  Reaction  Rates  (McGraw-Hill.  New 

York*  1969).  p.  119. 

JflJ  C.  Tally*  J.  Chem.  Phys.  93.  1061  (1990) 


Downloaded  19  Jttl  2006  to  139.78,79.49.  Redistribution  subject  to  AIP  license  or  copyright,  see  hltp^cp. alp.org/jcp/copynghl.jsp 


